<p>I've done the measuring in the past more comments: <br>
1) D.get() involves 2 dict lookups. The first one is getting the "get" attribute. And then it has function call overhead<br>
2) If x in d: return d[x] ... Is 2 lookups on success and 1 on failure, it is also a branch prediction<br>
3) try/except is 1 lookup on success and an except overhead on miss<br>
4) don't forget d.setdefault(x, N)<br>
Exceptions are much more expensive than a dict lookup. The hit rate matters a lot. Often you'll have a 99+% hit rate, so saving the extra lookups is worthwhile.<br>
If you have modest hit rate in a loop, you can cache the attribute lookup.<br>
D_get = D.get<br>
for x in ...:<br>
Y = D_get(x, ...)</p>
<p>I've done it all. But honestly, if a bit of code is that performance sensitive, you should write it in pyrex before you make the python code ugly.</p>
<p>Absolutely measure. In fact, measure with real data. Timeit is nice, lsprof is nice, but all just hints. (d.get triggers an lsprof trace, but x in d and d[x] is evaluated in the interpreter, w/o triggering trace, etc)</p>
<p>The rough guidelines I've found:<br>
1) use pyrex if it really matters<br>
2) use try/d[x]/except if you know that the keys are almost always there (eg counting types)<br>
3) use if x in d: d[x] if hit rate is low<br>
4) avoid extra attribute lookups, especially in list comprehensions<br>
John<br>
=:-></p>
<div class="gmail_quote">On Jun 28, 2011 12:25 PM, "Marius Kruger" <<a href="mailto:amanic@gmail.com">amanic@gmail.com</a>> wrote:<br type="attribution"></div>