Talk:File size distribution
Clarification on logarithms
It does not change anything to compute logarithms: if sizes P and Q are different, then log(P) and log(Q) will by different as well, unless the values are truncated. The task should be clarified about this. Anyway, it's very easy to adapt the program, for instance in the Python code, either by computing an aggregated key before storing in the dictionary, either by computing everuthng in the end (but then the dictionary will grow larger if there are many different sizes).