The RLE (run length encoding) pattern seems to come up a lot in my

Question

0

Asked: June 7, 20262026-06-07T03:02:57+00:00 2026-06-07T03:02:57+00:00

The RLE (run length encoding) pattern seems to come up a lot in my

0

The RLE (run length encoding) pattern seems to come up a lot in my work.

The essence of it is that you are outputting a reduction of the elements encountered since the last ‘break’ each time that you see a ‘break’ or you reach the end of the input.

(In actual RLE, the ‘break’ is just this character not matching the last character, but in the real world it’s usually a little more complex, but still a function of the current and last elements.)

I want to remove the duplicate last_val != None: rle.append((last_val, count)) condition and action which occur both in the loop and at the end.

The issues are:

replacing them with function calls results in more code, not less.
keeping it in imperative style (in Haskell, for example, the problem just evapourates).

The imperative Python code is:

#!/usr/bin/env python

data = "abbbccac"

if __name__ == '__main__':
  rle = []
  last_val = None
  count = 0;

  for val in data:
    if val != last_val and last_val != None:
      rle.append((last_val, count))
      count = 1
    else:
      count += 1
    last_val = val
  if last_val != None:
    rle.append((last_val, count))

  print rle

P.S. Trivially solvable in functional languages:

#!/usr/bin/env runhaskell
import Data.List (group)

dat = "abbbccac"

rle :: Eq a => [a] -> [(a, Int)]
rle arr = map (\g -> (head g, length g)) $ group arr

main :: IO ()
main = print $ rle dat

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-07T03:02:59+00:00

Here is a more imperative form. You can eliminate your duplicate code by adding or chaining to a throwaway sentinel that will never match any of your list elements, forcing an end-of-sequence pass through your “this-not-equal-last” code:

from itertools import chain

def rle(seq):
    ret = []
    sentinel = object()
    enum = enumerate(chain(seq,[sentinel]))
    start,last = next(enum)
    for i,c in enum:
        if c != last:
            ret.append((last,i-start))
            start,last = i,c
    return ret

This even gracefully handles the case where the input seq is empty, and the input can be any sequence, iterator, or generator, not just a string.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

The RLE (run length encoding) pattern seems to come up a lot in my

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply