Follow-on fix for bug 457825. Use sheet principal for agent and user sheets. r=dbaron...
[wine-gecko.git] / content / media / video / public / nsOggDecoder.h
blob074207911b405236fb22b8753670843107b5cc4b
1 /* -*- Mode: C++; tab-width: 2; indent-tabs-mode: nil; c-basic-offset: 2 -*- */
2 /* vim:set ts=2 sw=2 sts=2 et cindent: */
3 /* ***** BEGIN LICENSE BLOCK *****
4 * Version: ML 1.1/GPL 2.0/LGPL 2.1
6 * The contents of this file are subject to the Mozilla Public License Version
7 * 1.1 (the "License"); you may not use this file except in compliance with
8 * the License. You may obtain a copy of the License at
9 * http://www.mozilla.org/MPL/
11 * Software distributed under the License is distributed on an "AS IS" basis,
12 * WITHOUT WARRANTY OF ANY KIND, either express or implied. See the License
13 * for the specific language governing rights and limitations under the
14 * License.
16 * The Original Code is Mozilla code.
18 * The Initial Developer of the Original Code is the Mozilla Corporation.
19 * Portions created by the Initial Developer are Copyright (C) 2007
20 * the Initial Developer. All Rights Reserved.
22 * Contributor(s):
23 * Chris Double <chris.double@double.co.nz>
25 * Alternatively, the contents of this file may be used under the terms of
26 * either the GNU General Public License Version 2 or later (the "GPL"), or
27 * the GNU Lesser General Public License Version 2.1 or later (the "LGPL"),
28 * in which case the provisions of the GPL or the LGPL are applicable instead
29 * of those above. If you wish to allow use of your version of this file only
30 * under the terms of either the GPL or the LGPL, and not to allow others to
31 * use your version of this file under the terms of the MPL, indicate your
32 * decision by deleting the provisions above and replace them with the notice
33 * and other provisions required by the GPL or the LGPL. If you do not delete
34 * the provisions above, a recipient may use your version of this file under
35 * the terms of any one of the MPL, the GPL or the LGPL.
37 * ***** END LICENSE BLOCK ***** */
39 Each video element has one thread. This thread, called the Decode thread,
40 owns the resources for downloading and reading the video file. It goes through the
41 file, decoding the theora and vorbis data. It uses Oggplay to do the decoding.
42 It indirectly uses an nsMediaStream to do the file reading and seeking via Oggplay.
43 All file reads and seeks must occur on this thread only. It handles the sending
44 of the audio data to the sound device and the presentation of the video data
45 at the correct frame rate.
47 When the decode thread is created an event is dispatched to it. The event
48 runs for the lifetime of the playback of the resource. The decode thread
49 synchronises with the main thread via a single monitor held by the
50 nsOggDecoder object.
52 The event contains a Run method which consists of an infinite loop
53 that checks the state that the state machine is in and processes
54 operations on that state.
56 The nsOggDecodeStateMachine class is the event that gets dispatched to
57 the decode thread. It has the following states:
59 DECODING_METADATA
60 The Ogg headers are being loaded, and things like framerate, etc are
61 being decoded.
62 DECODING_FIRSTFRAME
63 The first frame of audio/video data is being decoded.
64 DECODING
65 Video/Audio frames are being decoded.
66 SEEKING
67 A seek operation is in progress.
68 BUFFERING
69 Decoding is paused while data is buffered for smooth playback.
70 COMPLETED
71 The resource has completed decoding.
72 SHUTDOWN
73 The decoder object is about to be destroyed.
75 The following result in state transitions.
77 Shutdown()
78 Clean up any resources the nsOggDecodeStateMachine owns.
79 Decode()
80 Start decoding video frames.
81 Buffer
82 This is not user initiated. It occurs when the
83 available data in the stream drops below a certain point.
84 Complete
85 This is not user initiated. It occurs when the
86 stream is completely decoded.
87 Seek(float)
88 Seek to the time position given in the resource.
90 A state transition diagram:
92 DECODING_METADATA
93 | | Shutdown()
94 v -->-------------------->--------------------------|
95 | |
96 DECODING_FIRSTFRAME v
97 | | Shutdown() |
98 v >-------------------->--------------------------|
99 | |------------->----->------------------------| v
100 DECODING | | | | |
101 ^ v Seek(t) | | | |
102 | Decode() | v | | |
103 ^-----------<----SEEKING | v Complete v v
104 | | | | | |
105 | | | COMPLETED SHUTDOWN-<-|
106 ^ ^ | |Shutdown() |
107 | | | >-------->-----^
108 | Decode() |Seek(t) |Buffer() |
109 -----------<--------<-------BUFFERING |
111 v Shutdown() |
113 ------------>-----|
115 The Main thread controls the decode state machine by setting the value
116 of a mPlayState variable and notifying on the monitor
117 based on the high level player actions required (Seek, Pause, Play, etc).
119 The player states are the states requested by the client through the
120 DOM API. They represent the desired state of the player, while the
121 decoder's state represents the actual state of the decoder.
123 The high level state of the player is maintained via a PlayState value.
124 It can have the following states:
126 START
127 The decoder has been initialized but has no resource loaded.
128 PAUSED
129 A request via the API has been received to pause playback.
130 LOADING
131 A request via the API has been received to load a resource.
132 PLAYING
133 A request via the API has been received to start playback.
134 SEEKING
135 A request via the API has been received to start seeking.
136 COMPLETED
137 Playback has completed.
138 SHUTDOWN
139 The decoder is about to be destroyed.
141 State transition occurs when the Media Element calls the Play, Seek,
142 etc methods on the nsOggDecoder object. When the transition occurs
143 nsOggDecoder then calls the methods on the decoder state machine
144 object to cause it to behave appropriate to the play state.
146 The following represents the states that the player can be in, and the
147 valid states the decode thread can be in at that time:
149 player LOADING decoder DECODING_METADATA, DECODING_FIRSTFRAME
150 player PLAYING decoder DECODING, BUFFERING, SEEKING, COMPLETED
151 player PAUSED decoder DECODING, BUFFERING, SEEKING, COMPLETED
152 player SEEKING decoder SEEKING
153 player COMPLETED decoder SHUTDOWN
154 player SHUTDOWN decoder SHUTDOWN
156 The general sequence of events with these objects is:
158 1) The video element calls Load on nsMediaDecoder. This creates the
159 decode thread and starts the channel for downloading the file. It
160 instantiates and starts the Decode state machine. The high level
161 LOADING state is entered, which results in the decode state machine
162 to start decoding metadata. These are the headers that give the
163 video size, framerate, etc. It returns immediately to the calling
164 video element.
166 2) When the Ogg metadata has been loaded by the decode thread it will
167 call a method on the video element object to inform it that this
168 step is done, so it can do the things required by the video
169 specification at this stage. The decoder then continues to decode
170 the first frame of data.
172 3) When the first frame of Ogg data has been successfully decoded it
173 calls a method on the video element object to inform it that this
174 step has been done, once again so it can do the required things by
175 the video specification at this stage.
177 This results in the high level state changing to PLAYING or PAUSED
178 depending on any user action that may have occurred.
180 The decode thread, while in the DECODING state, plays audio and
181 video, if the correct frame time comes around and the decoder
182 play state is PLAYING.
184 a/v synchronisation is done by a combination of liboggplay and the
185 Decoder state machine. liboggplay ensures that a decoded frame of data
186 has both the audio samples and the YUV data for that period of time.
188 When a frame is decoded by the decode state machine it converts the
189 YUV encoded video to RGB and copies the sound data to an internal
190 FrameData object. This is stored in a queue of available decoded frames.
191 Included in the FrameData object is the time that that frame should
192 be displayed.
194 The display state machine keeps track of the time since the last frame it
195 played. After decoding a frame it checks if it is time to display the next
196 item in the decoded frame queue. If so, it pops the item off the queue
197 and displays it.
199 Ideally a/v sync would take into account the actual audio clock of the
200 audio hardware for the sync rather than using the system clock.
201 Unfortunately getting valid time data out of the audio hardware has proven
202 to be unreliable across platforms (and even distributions in Linux) depending
203 on audio hardware, audio backend etc. The current approach works fine in practice
204 and is a compromise until this issue can be sorted. The plan is to eventually
205 move to synchronising using the audio hardware.
207 To prevent audio skipping and framerate dropping it is very important to
208 make sure no blocking occurs during the decoding process and minimise
209 expensive time operations at the time a frame is to be displayed. This is
210 managed by immediately converting video data to RGB on decode (an expensive
211 operation to do at frame display time) and checking if the sound device will
212 not block before writing sound data to it.
214 Shutdown needs to ensure that the event posted to the decode
215 thread is completed. The decode thread can potentially block internally
216 inside liboggplay when reading, seeking, or its internal buffers containing
217 decoded data are full. When blocked in this manner a call from the main thread
218 to Shutdown() will hang.
220 This is fixed with a protocol to ensure that the decode event cleanly
221 completes. The nsMediaStream that the nsChannelReader uses has a
222 Cancel() method. Calling this before Shutdown() will close any
223 internal streams or listeners resulting in blocked i/o completing with
224 an error, and all future i/o on the stream having an error.
226 This causes the decode thread to exit and Shutdown() can occur.
228 If the decode thread is seeking then the same Cancel() operation
229 causes an error to be returned from the seek call to liboggplay which
230 exits out of the seek operation, and stops the seek state running on the
231 decode thread.
233 If the decode thread is blocked due to internal decode buffers being
234 full, it is unblocked during the shutdown process by calling
235 oggplay_prepare_for_close.
237 In practice the OggPlay internal buffer should never fill as we retrieve and
238 process the frame immediately on decoding.
240 The Shutdown method on nsOggDecoder can spin the event loop as it waits
241 for threads to complete. Spinning the event loop is a bad thing to happen
242 during certain times like destruction of the media element. To work around
243 this the Shutdown method does nothing by queue an event to the main thread
244 to perform the actual Shutdown. This way the shutdown can occur at a safe
245 time.
247 This means the owning object of a nsOggDecoder object *MUST* call Shutdown
248 when destroying the nsOggDecoder object.
250 #if !defined(nsOggDecoder_h_)
251 #define nsOggDecoder_h_
253 #include "nsISupports.h"
254 #include "nsCOMPtr.h"
255 #include "nsIThread.h"
256 #include "nsIChannel.h"
257 #include "nsChannelReader.h"
258 #include "nsIObserver.h"
259 #include "nsIFrame.h"
260 #include "nsAutoPtr.h"
261 #include "nsSize.h"
262 #include "prlog.h"
263 #include "prmon.h"
264 #include "gfxContext.h"
265 #include "gfxRect.h"
266 #include "oggplay/oggplay.h"
267 #include "nsMediaDecoder.h"
269 class nsAudioStream;
270 class nsOggDecodeStateMachine;
272 class nsOggDecoder : public nsMediaDecoder
274 friend class nsOggDecodeStateMachine;
276 // ISupports
277 NS_DECL_ISUPPORTS
279 // nsIObserver
280 NS_DECL_NSIOBSERVER
282 public:
283 // Enumeration for the valid play states (see mPlayState)
284 enum PlayState {
285 PLAY_STATE_START,
286 PLAY_STATE_LOADING,
287 PLAY_STATE_PAUSED,
288 PLAY_STATE_PLAYING,
289 PLAY_STATE_SEEKING,
290 PLAY_STATE_ENDED,
291 PLAY_STATE_SHUTDOWN
294 nsOggDecoder();
295 ~nsOggDecoder();
296 PRBool Init();
298 // This method must be called by the owning object before that
299 // object disposes of this decoder object.
300 virtual void Shutdown();
302 virtual float GetCurrentTime();
304 virtual nsresult Load(nsIURI* aURI,
305 nsIChannel* aChannel,
306 nsIStreamListener **aListener);
308 // Start playback of a video. 'Load' must have previously been
309 // called.
310 virtual nsresult Play();
312 // Stop playback of a video, and stop download of video stream.
313 virtual void Stop();
315 // Seek to the time position in (seconds) from the start of the video.
316 virtual nsresult Seek(float time);
318 virtual nsresult PlaybackRateChanged();
320 virtual void Pause();
321 virtual float GetVolume();
322 virtual void SetVolume(float volume);
323 virtual float GetDuration();
325 virtual void GetCurrentURI(nsIURI** aURI);
326 virtual nsIPrincipal* GetCurrentPrincipal();
328 virtual void UpdateBytesDownloaded(PRUint64 aBytes);
330 // Called when the video file has completed downloading.
331 // Call on the main thread only.
332 void ResourceLoaded();
334 // Called if the media file encounters a network error.
335 // Call on the main thread only.
336 virtual void NetworkError();
338 // Call from any thread safely. Return PR_TRUE if we are currently
339 // seeking in the media resource.
340 virtual PRBool IsSeeking() const;
342 // Get the size of the media file in bytes. Called on the main thread only.
343 virtual void SetTotalBytes(PRInt64 aBytes);
345 // Set a flag indicating whether seeking is supported
346 virtual void SetSeekable(PRBool aSeekable);
348 // Return PR_TRUE if seeking is supported.
349 virtual PRBool GetSeekable();
351 // Returns the channel reader.
352 nsChannelReader* GetReader() { return mReader; }
354 protected:
356 // Returns the monitor for other threads to synchronise access to
357 // state.
358 PRMonitor* GetMonitor()
360 return mMonitor;
363 // Return the current state. Can be called on any thread. If called from
364 // a non-main thread, the decoder monitor must be held.
365 PlayState GetState()
367 return mPlayState;
370 /******
371 * The following methods must only be called on the main
372 * thread.
373 ******/
375 // Change to a new play state. This updates the mState variable and
376 // notifies any thread blocking on this object's monitor of the
377 // change. Call on the main thread only.
378 void ChangeState(PlayState aState);
380 // Called when the metadata from the Ogg file has been read.
381 // Call on the main thread only.
382 void MetadataLoaded();
384 // Called when the first frame has been loaded.
385 // Call on the main thread only.
386 void FirstFrameLoaded();
388 // Called when the video has completed playing.
389 // Call on the main thread only.
390 void PlaybackEnded();
392 // Return the current number of bytes loaded from the video file.
393 // This is used for progress events.
394 virtual PRUint64 GetBytesLoaded();
396 // Return the size of the video file in bytes.
397 // This is used for progress events.
398 virtual PRInt64 GetTotalBytes();
400 // Buffering of data has stopped. Inform the element on the main
401 // thread.
402 void BufferingStopped();
404 // Buffering of data has started. Inform the element on the main
405 // thread.
406 void BufferingStarted();
408 // Seeking has stopped. Inform the element on the main
409 // thread.
410 void SeekingStopped();
412 // Seeking has started. Inform the element on the main
413 // thread.
414 void SeekingStarted();
416 // Called when the backend has changed the current playback
417 // position. It dispatches a timeupdate event and invalidates the frame.
418 // This must be called on the main thread only.
419 void PlaybackPositionChanged();
421 private:
422 // Register/Unregister with Shutdown Observer.
423 // Call on main thread only.
424 void RegisterShutdownObserver();
425 void UnregisterShutdownObserver();
427 /******
428 * The following members should be accessed on the main thread only
429 ******/
430 // Total number of bytes downloaded so far.
431 PRUint64 mBytesDownloaded;
433 // The URI of the current resource
434 nsCOMPtr<nsIURI> mURI;
436 // Thread to handle decoding of Ogg data.
437 nsCOMPtr<nsIThread> mDecodeThread;
439 // The current playback position of the media resource in units of
440 // seconds. This is updated approximately at the framerate of the
441 // video (if it is a video) or the callback period of the audio.
442 // It is read and written from the main thread only.
443 float mCurrentTime;
445 // Volume that playback should start at. 0.0 = muted. 1.0 = full
446 // volume. Readable/Writeable from the main thread. Read from the
447 // audio thread when it is first started to get the initial volume
448 // level.
449 float mInitialVolume;
451 // Position to seek to when the seek notification is received by the
452 // decoding thread. Written by the main thread and read via the
453 // decoding thread. Synchronised using mPlayStateMonitor. If the
454 // value is negative then no seek has been requested. When a seek is
455 // started this is reset to negative.
456 float mRequestedSeekTime;
458 // Size of the media file in bytes. Set on the first non-byte range
459 // HTTP request from nsChannelToPipe Listener. Accessed on the
460 // main thread only.
461 PRInt64 mContentLength;
463 // Duration of the media resource. Set to -1 if unknown.
464 // Set when the Ogg metadata is loaded. Accessed on the main thread
465 // only.
466 PRInt64 mDuration;
468 // True if we are registered with the observer service for shutdown.
469 PRPackedBool mNotifyOnShutdown;
471 // True if the media resource is seekable (server supports byte range
472 // requests).
473 PRPackedBool mSeekable;
475 /******
476 * The following member variables can be accessed from any thread.
477 ******/
479 // The state machine object for handling the decoding via
480 // oggplay. It is safe to call methods of this object from other
481 // threads. Its internal data is synchronised on a monitor. The
482 // lifetime of this object is after mPlayState is LOADING and before
483 // mPlayState is SHUTDOWN. It is safe to access it during this
484 // period.
485 nsCOMPtr<nsOggDecodeStateMachine> mDecodeStateMachine;
487 // OggPlay object used to read data from a channel. Created on main
488 // thread. Passed to liboggplay and the locking for multithreaded
489 // access is handled by that library. Some methods are called from
490 // the decoder thread, and the state machine for that thread keeps
491 // a pointer to this reader. This is safe as the only methods called
492 // are threadsafe (via the threadsafe nsMediaStream).
493 nsAutoPtr<nsChannelReader> mReader;
495 // Monitor for detecting when the video play state changes. A call
496 // to Wait on this monitor will block the thread until the next
497 // state change.
498 PRMonitor* mMonitor;
500 // Set to one of the valid play states. It is protected by the
501 // monitor mMonitor. This monitor must be acquired when reading or
502 // writing the state. Any change to the state on the main thread
503 // must call NotifyAll on the monitor so the decode thread can wake up.
504 PlayState mPlayState;
506 // The state to change to after a seek or load operation. It must only
507 // be changed from the main thread. The decoder monitor must be acquired
508 // when writing to the state, or when reading from a non-main thread.
509 // Any change to the state must call NotifyAll on the monitor.
510 PlayState mNextState;
512 // Flags if we've called Stop(). Prevents multiple events being
513 // sent to call Shutdown(). Accessed on the main thread
514 // only.
515 PRPackedBool mIsStopping;
518 #endif