Issue 2067103002: Avoid unnecessary HW video encoder reconfiguration

skvlad

skvlad@webrtc.org changed reviewers: + henrika@webrtc.org, pbos@webrtc.org, pthatcher@webrtc.org

4 years, 6 months ago (2016-06-14 23:09:12 UTC) #1

skvlad

Description was changed from ========== Avoid unnecessary HW video encoder reconfiguration This change reduces the ...

4 years, 6 months ago (2016-06-15 02:11:23 UTC) #3

Description was changed from

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change
and the fix for requesting the correct frame orientation (coming in a
separate CL), the encoder should only be initialized once as opposed to
the ~3 times it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:
 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
  All frames from the camera are dropped until the corresponding VideoSendStream
is created.
 2. SetRemoteDescription() triggers the VideoSendStream creation. At
 this point, the encoder is configured for the first time, with the
 frame dimensions set to a low resolution default (176x144).
 3. When the first video frame is received from the camera after the
 VideoSendStreamIsCreated, the encoder is reconfigured to the correct
 dimensions. If we are using the Android hardware encoder, the default
 configuration is set to encode from a memory buffer (use_surface=false).
 4. When the frame is passed down to the encoder in
 androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
 a texture instead of a memory buffer. In this case, yet another
 reconfiguration takes place to enable encoding from a texture.

The fix makes the following changes:
 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
 dimensions, and whether the frame was stored in a texture.
 2. When the encoder is configured the first time
 (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
 are used instead of the default dimensions.
 3. A flag that indicates if encoding is to be done from a texture has
 been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
 and it's been wired up to be passed down all the way to the JNI code in
 androidmediaencoder_jni.cc.
 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
 flag from the VideoCodec structure instead of guessing the default as
 false. This way we end up with the correct encoder configuration the
 first time around.

Even with this fix, the encoder can be reconfigured twice - we can
learn that the remote side supports the rotation RTP header extension,
and configure the video source to send un-rotated frames to the encoder.
This may cause another resolution change if the first frame was rotated
(e.g. making a call from a cell phone in portrait orientation). I'm
going to address this in a separate CL, as it touches a different set of
files.

BUG=
==========

to

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change, 
the encoder should only be initialized once as opposed to the ~3 times
it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:

 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
    All frames from the camera are dropped until the corresponding 
    VideoSendStream is created.

 2. SetRemoteDescription() triggers the VideoSendStream creation. At
    this point, the encoder is configured for the first time, with the
    frame dimensions set to a low resolution default (176x144).

 3. When the first video frame is received from the camera after the
    VideoSendStreamIsCreated, the encoder is reconfigured to the correct
    dimensions. If we are using the Android hardware encoder, the default
    configuration is set to encode from a memory buffer (use_surface=false).

 4. When the frame is passed down to the encoder in
    androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
    a texture instead of a memory buffer. In this case, yet another
    reconfiguration takes place to enable encoding from a texture.

 5. Even if the resolution and texture flag were known at the start of 
    the call, there would be a reconfiguration involved if the camera is
    rotated (such as when making a call from a phone in portrait orientation).
    The reason for that is that at construction time, WebRtcVideoEngine2 
    sets the VideoSinkWants structure parameter to request frames rotated
    by the source; the early frames will then arrive in portrait resolution. 
    When the remote description is finally set, if the rotation RTP extension 
    is supported by the remote receiver, the source is asked to provide
    non-rotated frames. The very next frame will then arrive in landscape 
    resolution with a non-zero rotation value to be applied by the receiver. 
    Since the encoder was configured with the last (portrait) frame size, 
    it's going to need to be reconfigured again.

The fix makes the following changes:

 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
    dimensions, and whether the frame was stored in a texture.

 2. When the encoder is configured the first time
    (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
    are used instead of the default dimensions.

 3. A flag that indicates if encoding is to be done from a texture has
    been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
    and it's been wired up to be passed down all the way to the JNI code in
    androidmediaencoder_jni.cc.

 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
    flag from the VideoCodec structure instead of guessing the default as
    false. This way we end up with the correct encoder configuration the
    first time around.

 5. WebRtcVideoSendStream now takes an optimistic guess and requests non-
    rotated frames when the supported RtpExtensions list is not available. 
    This makes the "early" frames arrive non-rotated, and the cached dimensions
    will be correct for the common case when the rotation extension is
supported.
    If the other side is an older endpoint which does not support rotation, 
    the encoder will have to be reconfigured - but it's better to penalize the 
    uncommon case rather than the common one.
==========

henrika_webrtc

henrika@webrtc.org changed reviewers: + perkj@webrtc.org - henrika@webrtc.org, pthatcher@webrtc.org

4 years, 6 months ago (2016-06-15 08:11:52 UTC) #4

henrika_webrtc

Removed myself and pthatcher as reviewers. Added perkj@ instead. I don't know the video-related parts ...

4 years, 6 months ago (2016-06-15 08:14:03 UTC) #5

perkj_webrtc

perkj@webrtc.org changed reviewers: + pthatcher@webrtc.org

4 years, 6 months ago (2016-06-15 11:31:29 UTC) #6

perkj_webrtc

nice. Just nits. https://codereview.webrtc.org/2067103002/diff/20001/webrtc/common_types.h File webrtc/common_types.h (right): https://codereview.webrtc.org/2067103002/diff/20001/webrtc/common_types.h#newcode708 webrtc/common_types.h:708: bool encodeFromTexture; encode_from_texture please. We should ...

4 years, 6 months ago (2016-06-15 11:31:30 UTC) #7

skvlad

skvlad@webrtc.org changed reviewers: + tina.legrand@webrtc.org, tommi@webrtc.org

4 years, 6 months ago (2016-06-15 19:44:33 UTC) #9

skvlad

Added Tina and Tommi as reviewers for webrtc/config.h and webrtc/common_types.h. https://codereview.webrtc.org/2067103002/diff/20001/webrtc/common_types.h File webrtc/common_types.h (right): https://codereview.webrtc.org/2067103002/diff/20001/webrtc/common_types.h#newcode708 ...

4 years, 6 months ago (2016-06-15 19:44:34 UTC) #10

perkj_webrtc

perkj@webrtc.org changed reviewers: + mflodman@webrtc.org - tina.legrand@webrtc.org

4 years, 6 months ago (2016-06-15 20:18:53 UTC) #11

pthatcher1

Just style and readability stuff. The logic seem sound. It's a great change, by the ...

4 years, 6 months ago (2016-06-15 20:40:23 UTC) #13

tommi

nice! lgtm once current comments have been addressed. https://codereview.webrtc.org/2067103002/diff/60001/webrtc/api/java/jni/androidmediaencoder_jni.cc File webrtc/api/java/jni/androidmediaencoder_jni.cc (right): https://codereview.webrtc.org/2067103002/diff/60001/webrtc/api/java/jni/androidmediaencoder_jni.cc#newcode420 webrtc/api/java/jni/androidmediaencoder_jni.cc:420: codec_settings->encode_from_texture ...

4 years, 6 months ago (2016-06-15 21:00:39 UTC) #14

skvlad

https://codereview.webrtc.org/2067103002/diff/60001/webrtc/api/java/jni/androidmediaencoder_jni.cc File webrtc/api/java/jni/androidmediaencoder_jni.cc (right): https://codereview.webrtc.org/2067103002/diff/60001/webrtc/api/java/jni/androidmediaencoder_jni.cc#newcode420 webrtc/api/java/jni/androidmediaencoder_jni.cc:420: codec_settings->encode_from_texture /* use_surface */)); On 2016/06/15 21:00:39, tommi-webrtc wrote: ...

4 years, 6 months ago (2016-06-15 22:10:36 UTC) #15

skvlad

The patchset sent to the CQ was uploaded after l-g-t-m from tommi@webrtc.org, perkj@webrtc.org Link to ...

4 years, 6 months ago (2016-06-16 18:36:19 UTC) #18

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/2067103002/90001

4 years, 6 months ago (2016-06-16 18:36:22 UTC) #19

commit-bot: I haz the power

Description was changed from ========== Avoid unnecessary HW video encoder reconfiguration This change reduces the ...

4 years, 6 months ago (2016-06-16 19:08:07 UTC) #20

Message was sent while issue was closed.

Description was changed from

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change, 
the encoder should only be initialized once as opposed to the ~3 times
it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:

 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
    All frames from the camera are dropped until the corresponding 
    VideoSendStream is created.

 2. SetRemoteDescription() triggers the VideoSendStream creation. At
    this point, the encoder is configured for the first time, with the
    frame dimensions set to a low resolution default (176x144).

 3. When the first video frame is received from the camera after the
    VideoSendStreamIsCreated, the encoder is reconfigured to the correct
    dimensions. If we are using the Android hardware encoder, the default
    configuration is set to encode from a memory buffer (use_surface=false).

 4. When the frame is passed down to the encoder in
    androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
    a texture instead of a memory buffer. In this case, yet another
    reconfiguration takes place to enable encoding from a texture.

 5. Even if the resolution and texture flag were known at the start of 
    the call, there would be a reconfiguration involved if the camera is
    rotated (such as when making a call from a phone in portrait orientation).
    The reason for that is that at construction time, WebRtcVideoEngine2 
    sets the VideoSinkWants structure parameter to request frames rotated
    by the source; the early frames will then arrive in portrait resolution. 
    When the remote description is finally set, if the rotation RTP extension 
    is supported by the remote receiver, the source is asked to provide
    non-rotated frames. The very next frame will then arrive in landscape 
    resolution with a non-zero rotation value to be applied by the receiver. 
    Since the encoder was configured with the last (portrait) frame size, 
    it's going to need to be reconfigured again.

The fix makes the following changes:

 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
    dimensions, and whether the frame was stored in a texture.

 2. When the encoder is configured the first time
    (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
    are used instead of the default dimensions.

 3. A flag that indicates if encoding is to be done from a texture has
    been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
    and it's been wired up to be passed down all the way to the JNI code in
    androidmediaencoder_jni.cc.

 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
    flag from the VideoCodec structure instead of guessing the default as
    false. This way we end up with the correct encoder configuration the
    first time around.

 5. WebRtcVideoSendStream now takes an optimistic guess and requests non-
    rotated frames when the supported RtpExtensions list is not available. 
    This makes the "early" frames arrive non-rotated, and the cached dimensions
    will be correct for the common case when the rotation extension is
supported.
    If the other side is an older endpoint which does not support rotation, 
    the encoder will have to be reconfigured - but it's better to penalize the 
    uncommon case rather than the common one.
==========

to

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change, 
the encoder should only be initialized once as opposed to the ~3 times
it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:

 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
    All frames from the camera are dropped until the corresponding 
    VideoSendStream is created.

 2. SetRemoteDescription() triggers the VideoSendStream creation. At
    this point, the encoder is configured for the first time, with the
    frame dimensions set to a low resolution default (176x144).

 3. When the first video frame is received from the camera after the
    VideoSendStreamIsCreated, the encoder is reconfigured to the correct
    dimensions. If we are using the Android hardware encoder, the default
    configuration is set to encode from a memory buffer (use_surface=false).

 4. When the frame is passed down to the encoder in
    androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
    a texture instead of a memory buffer. In this case, yet another
    reconfiguration takes place to enable encoding from a texture.

 5. Even if the resolution and texture flag were known at the start of 
    the call, there would be a reconfiguration involved if the camera is
    rotated (such as when making a call from a phone in portrait orientation).
    The reason for that is that at construction time, WebRtcVideoEngine2 
    sets the VideoSinkWants structure parameter to request frames rotated
    by the source; the early frames will then arrive in portrait resolution. 
    When the remote description is finally set, if the rotation RTP extension 
    is supported by the remote receiver, the source is asked to provide
    non-rotated frames. The very next frame will then arrive in landscape 
    resolution with a non-zero rotation value to be applied by the receiver. 
    Since the encoder was configured with the last (portrait) frame size, 
    it's going to need to be reconfigured again.

The fix makes the following changes:

 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
    dimensions, and whether the frame was stored in a texture.

 2. When the encoder is configured the first time
    (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
    are used instead of the default dimensions.

 3. A flag that indicates if encoding is to be done from a texture has
    been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
    and it's been wired up to be passed down all the way to the JNI code in
    androidmediaencoder_jni.cc.

 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
    flag from the VideoCodec structure instead of guessing the default as
    false. This way we end up with the correct encoder configuration the
    first time around.

 5. WebRtcVideoSendStream now takes an optimistic guess and requests non-
    rotated frames when the supported RtpExtensions list is not available. 
    This makes the "early" frames arrive non-rotated, and the cached dimensions
    will be correct for the common case when the rotation extension is
supported.
    If the other side is an older endpoint which does not support rotation, 
    the encoder will have to be reconfigured - but it's better to penalize the 
    uncommon case rather than the common one.
==========

commit-bot: I haz the power

Description was changed from ========== Avoid unnecessary HW video encoder reconfiguration This change reduces the ...

4 years, 6 months ago (2016-06-16 19:08:15 UTC) #22

Message was sent while issue was closed.

Description was changed from

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change, 
the encoder should only be initialized once as opposed to the ~3 times
it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:

 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
    All frames from the camera are dropped until the corresponding 
    VideoSendStream is created.

 2. SetRemoteDescription() triggers the VideoSendStream creation. At
    this point, the encoder is configured for the first time, with the
    frame dimensions set to a low resolution default (176x144).

 3. When the first video frame is received from the camera after the
    VideoSendStreamIsCreated, the encoder is reconfigured to the correct
    dimensions. If we are using the Android hardware encoder, the default
    configuration is set to encode from a memory buffer (use_surface=false).

 4. When the frame is passed down to the encoder in
    androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
    a texture instead of a memory buffer. In this case, yet another
    reconfiguration takes place to enable encoding from a texture.

 5. Even if the resolution and texture flag were known at the start of 
    the call, there would be a reconfiguration involved if the camera is
    rotated (such as when making a call from a phone in portrait orientation).
    The reason for that is that at construction time, WebRtcVideoEngine2 
    sets the VideoSinkWants structure parameter to request frames rotated
    by the source; the early frames will then arrive in portrait resolution. 
    When the remote description is finally set, if the rotation RTP extension 
    is supported by the remote receiver, the source is asked to provide
    non-rotated frames. The very next frame will then arrive in landscape 
    resolution with a non-zero rotation value to be applied by the receiver. 
    Since the encoder was configured with the last (portrait) frame size, 
    it's going to need to be reconfigured again.

The fix makes the following changes:

 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
    dimensions, and whether the frame was stored in a texture.

 2. When the encoder is configured the first time
    (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
    are used instead of the default dimensions.

 3. A flag that indicates if encoding is to be done from a texture has
    been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
    and it's been wired up to be passed down all the way to the JNI code in
    androidmediaencoder_jni.cc.

 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
    flag from the VideoCodec structure instead of guessing the default as
    false. This way we end up with the correct encoder configuration the
    first time around.

 5. WebRtcVideoSendStream now takes an optimistic guess and requests non-
    rotated frames when the supported RtpExtensions list is not available. 
    This makes the "early" frames arrive non-rotated, and the cached dimensions
    will be correct for the common case when the rotation extension is
supported.
    If the other side is an older endpoint which does not support rotation, 
    the encoder will have to be reconfigured - but it's better to penalize the 
    uncommon case rather than the common one.
==========

to

==========
Avoid unnecessary HW video encoder reconfiguration

This change reduces the number of times the Android hardware video
encoder is reconfigured when making an outgoing call. With this change,
the encoder should only be initialized once as opposed to the ~3 times
it happens currently.

Before the fix, the following sequence of events caused the extra
reconfigurations:

 1. After the SetLocalDescription call, the WebRtcVideoSendStream is created.
    All frames from the camera are dropped until the corresponding
    VideoSendStream is created.

 2. SetRemoteDescription() triggers the VideoSendStream creation. At
    this point, the encoder is configured for the first time, with the
    frame dimensions set to a low resolution default (176x144).

 3. When the first video frame is received from the camera after the
    VideoSendStreamIsCreated, the encoder is reconfigured to the correct
    dimensions. If we are using the Android hardware encoder, the default
    configuration is set to encode from a memory buffer (use_surface=false).

 4. When the frame is passed down to the encoder in
    androidmediaencoder_jni.cc EncodeOnCodecThread(), it may be stored in
    a texture instead of a memory buffer. In this case, yet another
    reconfiguration takes place to enable encoding from a texture.

 5. Even if the resolution and texture flag were known at the start of
    the call, there would be a reconfiguration involved if the camera is
    rotated (such as when making a call from a phone in portrait orientation).
    The reason for that is that at construction time, WebRtcVideoEngine2
    sets the VideoSinkWants structure parameter to request frames rotated
    by the source; the early frames will then arrive in portrait resolution.
    When the remote description is finally set, if the rotation RTP extension
    is supported by the remote receiver, the source is asked to provide
    non-rotated frames. The very next frame will then arrive in landscape
    resolution with a non-zero rotation value to be applied by the receiver.
    Since the encoder was configured with the last (portrait) frame size,
    it's going to need to be reconfigured again.

The fix makes the following changes:

 1. WebRtcVideoSendStream::OnFrame() now caches the last seen frame
    dimensions, and whether the frame was stored in a texture.

 2. When the encoder is configured the first time
    (WebRtcVideoSendStream::SetCodec()) - the last seen frame dimensions
    are used instead of the default dimensions.

 3. A flag that indicates if encoding is to be done from a texture has
    been added to the webrtc::VideoStream and webrtc::VideoCodec structs,
    and it's been wired up to be passed down all the way to the JNI code in
    androidmediaencoder_jni.cc.

 4. MediaCodecVideoEncoder::InitEncode is now reading the is_surface
    flag from the VideoCodec structure instead of guessing the default as
    false. This way we end up with the correct encoder configuration the
    first time around.

 5. WebRtcVideoSendStream now takes an optimistic guess and requests non-
    rotated frames when the supported RtpExtensions list is not available.
    This makes the "early" frames arrive non-rotated, and the cached dimensions
    will be correct for the common case when the rotation extension is
supported.
    If the other side is an older endpoint which does not support rotation,
    the encoder will have to be reconfigured - but it's better to penalize the
    uncommon case rather than the common one.

Committed: https://crrev.com/3abb7644001d264c402184705950111d3fb8f181
Cr-Commit-Position: refs/heads/master@{#13173}
==========

commit-bot: I haz the power

4 years, 6 months ago (2016-06-16 19:08:19 UTC) #23

Message was sent while issue was closed.

Patchset 5 (id:??) landed as
https://crrev.com/3abb7644001d264c402184705950111d3fb8f181
Cr-Commit-Position: refs/heads/master@{#13173}

Issue 2067103002: Avoid unnecessary HW video encoder reconfiguration (Closed)

Description

Patch Set 1 #

Patch Set 2 : Optimistically guess rotation is supported #

Patch Set 3 : Code review feedback #

Patch Set 4 : More CR feedback; replaced Dimensions with VideoFrameInfo #

Patch Set 5 : ReconfigureEncoderIfNecessary -> ReconfigureEncoder() #

Messages