tools/py_event_log_analyzer/misc.py - Issue 1999113002: New rtc dump analyzing tool in Python

Side by Side Diff: tools/py_event_log_analyzer/misc.py

Issue 1999113002: New rtc dump analyzing tool in Python (Closed) Base URL: https://chromium.googlesource.com/external/webrtc.git@master

Patch Set: pylint issues Created 4 years, 7 months ago

Use n/p to move between diff chunks; N/P to move between comments. Draft comments are only viewable by you.

Jump to:

View unified diff | Download patch

OLD	NEW
(Empty)
	1 # Copyright (c) 2016 The WebRTC project authors. All Rights Reserved.

	2 #

	3 # Use of this source code is governed by a BSD-style license

	4 # that can be found in the LICENSE file in the root of the source

	5 # tree. An additional intellectual property rights grant can be found

	6 # in the file PATENTS. All contributing project authors may

	7 # be found in the AUTHORS file in the root of the source tree.

	8

	9 """Utility functions for statistics calculations.
	ivoc 2016/05/24 11:50:17 I would reorder this into: Utility functions for c I would reorder this into: Utility functions for calculating statistics. aleloi 2016/05/24 15:54:40 Done. Show quoted text On 2016/05/24 11:50:17, ivoc wrote: > I would reorder this into: Utility functions for calculating statistics. Done.
	10 """

	11

	12

	13 import collections

	14

	15

	16 def count_reordered(seq_no_s):
	ivoc 2016/05/24 11:50:18 please rename seq_no_s to sequence_numbers please rename seq_no_s to sequence_numbers kwiberg-webrtc 2016/05/24 12:09:51 A short comment that describes what this function A short comment that describes what this function does? And all the other functions... aleloi 2016/05/24 15:54:40 Done. Show quoted text On 2016/05/24 11:50:18, ivoc wrote: > please rename seq_no_s to sequence_numbers Done. aleloi 2016/05/24 15:54:40 A little better now, I think Show quoted text On 2016/05/24 12:09:51, kwiberg-webrtc wrote: > A short comment that describes what this function does? And all the other > functions... A little better now, I think
	17 largest = seq_no_s[0]

	18 result = 0

	19 for seq_no in seq_no_s:

	20 if seq_no < largest:

	21 result += 1

	22 largest = max(largest, seq_no)
	kwiberg-webrtc 2016/05/24 12:09:51 Nit: "else: largest = seq_no" instead of this stat Nit: "else: largest = seq_no" instead of this statement. aleloi 2016/05/24 15:54:40 Done. Show quoted text On 2016/05/24 12:09:51, kwiberg-webrtc wrote: > Nit: "else: largest = seq_no" instead of this statement. Done.
	23 return result

	24

	25

	26 def ssrc_size_table(data_points):

	27 d = collections.defaultdict(int)

	28 for dt in data_points:

	29 d[dt.ssrc] += dt.size

	30 total = sum(d.values())
	kwiberg-webrtc 2016/05/24 12:09:51 float(sum(d.values())) here, so that you don't hav float(sum(d.values())) here, so that you don't have to call float() in each iteration of the loop. aleloi 2016/05/24 15:54:40 removed "float" and added "from __future__ import Show quoted text On 2016/05/24 12:09:51, kwiberg-webrtc wrote: > float(sum(d.values())) here, so that you don't have to call float() in each > iteration of the loop. removed "float" and added "from __future__ import division" on top instead
	31 for key in d:

	32 d[key] /= float(total)

	33 return d

	34

	35

	36 def percent_table(data):

	37 d = collections.defaultdict(int)

	38 for dt in data:

	39 d[dt] += 1
	ivoc 2016/05/24 11:50:17 These 3 lines can be replaced by: d = collections. These 3 lines can be replaced by: d = collections.Counter(data) aleloi 2016/05/24 15:54:40 Thank you! Show quoted text On 2016/05/24 11:50:17, ivoc wrote: > These 3 lines can be replaced by: d = collections.Counter(data) Thank you!
	40 total = sum(d.values())

	41 for key in d:

	42 d[key] /= float(total)

	43 return d

	44

	45

	46 def hists(data, n_bins):

	47 """Generates a histogram of `data`.

	48

	49 The sequence of numbers `data` is divided into `n_bins` many

	50 bins with even proportion of data points in each.
	ivoc 2016/05/24 11:50:18 Is there a difference between this and numpy.histo Is there a difference between this and numpy.histogram? kwiberg-webrtc 2016/05/24 12:09:50 Not sure I understand. The intervals will have dif Not sure I understand. The intervals will have different sizes, to make the proportion of data points in each bin as close to 1/n_bins as possible? aleloi 2016/05/24 15:54:40 I got about the same result with numpy.histogram. Show quoted text On 2016/05/24 11:50:18, ivoc wrote: > Is there a difference between this and numpy.histogram? I got about the same result with numpy.histogram. hists is deleted. Thank you for the pointer!
	51

	52 Args:

	53 data: a sequence of numbers

	54 n_bins: the number of histogram bins into which data is divided.

	55 Returns:

	56 A dict mapping from histogram bin bounds to proportions. Dictionary

	57 keys are pairs of numbers. If `d` is the return value, and `d`

	58 contains an interval (10, 15), d[(10, 15)] is the proportion of

	59 values in data inside this
	ivoc 2016/05/24 11:50:17 Please add a comment on the boundary values (inclu Please add a comment on the boundary values (included in the range or not?) kwiberg-webrtc 2016/05/24 12:09:50 Are the intervals open, closed, or something else? Are the intervals open, closed, or something else? aleloi 2016/05/24 15:54:40 not longer relevant Show quoted text On 2016/05/24 11:50:17, ivoc wrote: > Please add a comment on the boundary values (included in the range or not?) not longer relevant
	60 """

	61 cumulative = 0

	62 start = sorted(data)[0]-1

	63 result = {}

	64 for i in sorted(data):
	kwiberg-webrtc 2016/05/24 12:09:50 i is a bad name for a variable that's not an index i is a bad name for a variable that's not an index. Or is data really supposed to be a sequence of indexes? kwiberg-webrtc 2016/05/24 12:09:51 You sort data twice. You sort data twice. aleloi 2016/05/24 15:54:40 here too Show quoted text On 2016/05/24 12:09:50, kwiberg-webrtc wrote: > i is a bad name for a variable that's not an index. Or is data really supposed > to be a sequence of indexes? here too
	65 cumulative += data[i]
	kwiberg-webrtc 2016/05/24 12:09:51 You add data[i] here rather than 1, using the elem You add data[i] here rather than 1, using the elements of data as indexes into data. aleloi 2016/05/24 15:54:40 here too Show quoted text On 2016/05/24 12:09:51, kwiberg-webrtc wrote: > You add data[i] here rather than 1, using the elements of data as indexes into > data. here too
	66 if cumulative >= 1/float(n_bins):
	ivoc 2016/05/24 11:50:17 Is this correct? What if the list contains only ve Is this correct? What if the list contains only very large numbers? kwiberg-webrtc 2016/05/24 12:09:50 cumulative and n_bins are both integers, right, wi cumulative and n_bins are both integers, right, with n_bins > 0? If so, this condition is equivalent to cumulative > 0. aleloi 2016/05/24 15:54:40 here too Show quoted text On 2016/05/24 11:50:17, ivoc wrote: > Is this correct? What if the list contains only very large numbers? here too
	67 result[(start, i)] = cumulative

	68 start = i+1
	kwiberg-webrtc 2016/05/24 12:09:50 You surround binary operators with spaces almost b You surround binary operators with spaces almost but not quite everywhere. aleloi 2016/05/24 15:54:40 here too Show quoted text On 2016/05/24 12:09:50, kwiberg-webrtc wrote: > You surround binary operators with spaces almost but not quite everywhere. here too
	69 cumulative = 0

	70 if cumulative != 0:

	71 result[(start, len(data)-1)] = cumulative

	72 return result

	73

	74

	75 def unwrap(data, mod):

	76 """Unwraps `data` modulo `mod`.

	77

	78 If data contains consecutive elements with difference above `mod/2`,

	79 an integer multiple of `mod` is added to one of the elements.
	ivoc 2016/05/24 11:50:18 By reading the code below it looks like the 'integ By reading the code below it looks like the 'integer multiple' is at most one and it is always added to the latter of the consecutive elements. Perhaps that should be in the description? kwiberg-webrtc 2016/05/24 12:09:50 I think this is under-specified. What integer mult I think this is under-specified. What integer multiple, and to which element? aleloi 2016/05/24 15:54:40 unwrap should work like matlab's unwrap (http://se Show quoted text On 2016/05/24 12:09:50, kwiberg-webrtc wrote: > I think this is under-specified. What integer multiple, and to which element? unwrap should work like matlab's unwrap (http://se.mathworks.com/help/matlab/ref/unwrap.html), but for integers. I added more description, but it is still a little unclear.
	80

	81 """

	82 prev = data[0]

	83 for idxm1, curr in enumerate(data[1:]):

	84 idx = idxm1 + 1

	85 delta_pos = (curr-prev)%mod

	86 delta_neg = (curr-prev)%mod - mod

	87 delta = delta_pos if abs(delta_pos) < abs(delta_neg) else delta_neg

	88 curr = prev + delta

	89 data[idx] = curr

	90 prev = curr

	91 return data

OLD	NEW

« tools/py_event_log_analyzer/README ('K') | « tools/py_event_log_analyzer/README ('k') | tools/py_event_log_analyzer/pb_parse.py » ('j') | tools/py_event_log_analyzer/pb_parse.py » ('J')