webrtc/modules/audio_processing/vad/voice_activity_detector.h - Issue 1181933002: Pull the Voice Activity Detector out from the AGC

Side by Side Diff

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Keyboard Shortcuts

	File
u :	up to issue
j / k :	jump to file after / before current file
J / K :	jump to next file with a comment after / before current file
	Side-by-side diff
i :	toggle intra-line diffs
e :	expand all comments
c :	collapse all comments
s :	toggle showing all comments
n / p :	next / previous diff chunk or comment
N / P :	next / previous comment
<Up> / <Down> :	next / previous line

	Issue
u :	up to list of issues
j / k :	jump to patch after / before current patch
o / <Enter> :	open current patch in side-by-side view
i :	open current patch in unified diff view

	Issue List
j / k :	jump to issue after / before current issue
o / <Enter> :	open current issue

Side by Side Diff: webrtc/modules/audio_processing/vad/voice_activity_detector.h

Issue 1181933002: Pull the Voice Activity Detector out from the AGC (Closed) Base URL: https://chromium.googlesource.com/external/webrtc.git@master

Patch Set: Changed VoiceActivityDetectorTest to use vector Created 5 years, 6 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch

« no previous file with comments | « webrtc/modules/audio_processing/vad/vad_circular_buffer_unittest.cc ('k') | webrtc/modules/audio_processing/vad/voice_activity_detector.cc » ('j') | webrtc/modules/audio_processing/vad/voice_activity_detector.cc » ('J')
Toggle Intra-line Diffs ('i') | Expand Comments ('e') | Collapse Comments ('c') | Hide Comments ('s')

OLD	NEW
(Empty)
	1 /*

	2 * Copyright (c) 2015 The WebRTC project authors. All Rights Reserved.

	3 *

	4 * Use of this source code is governed by a BSD-style license

	5 * that can be found in the LICENSE file in the root of the source

	6 * tree. An additional intellectual property rights grant can be found

	7 * in the file PATENTS. All contributing project authors may

	8 * be found in the AUTHORS file in the root of the source tree.

	9 */

	10

	11 #ifndef WEBRTC_MODULES_AUDIO_PROCESSING_VAD_VOICE_ACTIVITY_DETECTOR_H_

	12 #define WEBRTC_MODULES_AUDIO_PROCESSING_VAD_VOICE_ACTIVITY_DETECTOR_H_

	13

	14 #include <vector>

	15

	16 #include "webrtc/base/scoped_ptr.h"

	17 #include "webrtc/common_audio/resampler/include/resampler.h"

	18 #include "webrtc/modules/audio_processing/vad/vad_audio_proc.h"

	19 #include "webrtc/modules/audio_processing/vad/common.h"

	20 #include "webrtc/modules/audio_processing/vad/pitch_based_vad.h"

	21 #include "webrtc/modules/audio_processing/vad/standalone_vad.h"

	22

	23 namespace webrtc {

	24

	25 // A Voice Activity Detector (VAD) that combines the voice probability from the

	26 // StandaloneVad and PitchBasedVad to get a more robust estimation.

	27 class VoiceActivityDetector {

	28 public:

	29 VoiceActivityDetector();

	30

	31 // Processes each audio chunk and estimates the voice probability. The maximum

	32 // supported sample rate is 32kHz.

	33 // TODO(aluebs): Change \|length\| to size_t.

	34 void ProcessChunk(const int16_t* audio, int length, int sample_rate_hz);

	35

	36 // Returns a vector of voice probabilities for each chunk. It can be empty for

	37 // some chunks, but it catches up afterwards returning multiple values at

	38 // once.

	39 const std::vector<double>& chunkwise_voice_probabilities() const {

	40 return chunkwise_voice_probabilities_;

	41 }

	42

	43 // Returns a vector of RMS values for each chunk. It has the same length as

	44 // chunkwise_voice_probabilities().

	45 const std::vector<double>& chunkwise_rms() const { return chunkwise_rms_; }

	46

	47 // Returns the last voice probability, regardless of the internal

	48 // implementation, although it has a few chunks of delay.

	49 float last_voice_probability() const { return last_voice_probability_; }

	50

	51 private:

	52 // TODO(aluebs): Change these to float.

	53 std::vector<double> chunkwise_voice_probabilities_;

	54 std::vector<double> chunkwise_rms_;

	55

	56 float last_voice_probability_;

	57

	58 Resampler resampler_;

	59 VadAudioProc audio_processing_;

	60

	61 rtc::scoped_ptr<StandaloneVad> standalone_vad_;

	62 PitchBasedVad pitch_based_vad_;

	63

	64 int16_t resampled_[kLength10Ms];

	65 AudioFeatures features_;

	66 };

	67

	68 } // namespace webrtc

	69

	70 #endif // WEBRTC_MODULES_AUDIO_PROCESSING_VAD_VOICE_ACTIVITY_DETECTOR_H_

OLD	NEW