I need to read gigabytes of text so I’m trying to optimize my code.

Question

0

Asked: May 19, 20262026-05-19T15:07:34+00:00 2026-05-19T15:07:34+00:00

I need to read gigabytes of text so I’m trying to optimize my code.

0

I need to read gigabytes of text so I’m trying to optimize my code. When doing this I found that, for my problem, using a dictionary is faster than if-tests.

check = {'R':'-', 'F':'+'}
seqs = ['R', 'F']*100

def check1():
    for entry in seqs:
        if entry == 'R':
            strand = '-'
        if entry == 'F':
            strand = '+'

def check2():
    for entry in seqs:
        strand = check[entry]

Using ipythong’s %timeit I see that looking up in a dictionary is slightly more than twice as fast as using two if-tests:

In [63]: %timeit check1()
10000 loops, best of 3: 38.8 us per loop

In [64]: %timeit check2()
100000 loops, best of 3: 16.2 us per loop

Since if-tests are so basic, I did not expect a performance difference. Is this well known? Can anybody explain why this is so?

UPDATE

I checked how the two functions above as well as check3() below affect the runtime of my actual code, and there’s no effect on the total time. So either the boost gotten with the dictionary is not so high in a real-world example where the ‘R’ and ‘F’ values need to be re-read from file constantly, or this piece of code is just not part of my bottleneck.

Anyway thanks for the answers!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-19T15:07:35+00:00

Editorial Team

2026-05-19T15:07:35+00:00Added an answer on May 19, 2026 at 3:07 pm

As with a lot of VM code, it mostly comes down to the number of VM opcodes involved.

You can examine the assembled functions with dis:

import dis
dis.dis(func)

In 2.6.4, check1 takes around 15-20 opcodes (depending on the code path), for each comparison and branch. check2 takes just 7 (after adding the missing chedict dictionary, declared globally).

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I need to read gigabytes of text so I’m trying to optimize my code.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply