I’m curious how I should go about improving the performance of a Haskell routine

Question

0

Asked: May 28, 20262026-05-28T04:07:00+00:00 2026-05-28T04:07:00+00:00

I’m curious how I should go about improving the performance of a Haskell routine

0

I’m curious how I should go about improving the performance of a Haskell routine that finds the lexicographically minimal cyclic rotation of a string.

import Data.List
swapAt n = f . splitAt n where f (a,b) = b++a
minimumrotation x = minimum $ map (\i -> swapAt i x) $ elemIndices (minimum x) x

I’d imagine that I should use Data.Vector rather than lists because Data.Vector provides in-place operations, probably just manipulating some indices into the original data. I shouldn’t actually need to bother tracking the indices myself to avoid excess copying, right?

I’m curious how the ++ impact the optimization though. I’d imagine it produces a lazy string thunk that never does the appending until the string gets read that far. Ergo, the a should never actually be appended onto the b whenever minimum can eliminate that string early, like because it begins with some very later letter. Is this correct?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-28T04:07:01+00:00

xs ++ ys adds some overhead in all the list cells from xs, but once it reaches the end of xs it’s free — it just returns ys.

Looking at the definition of (++) helps to see why:

[] ++ ys = ys
(x:xs) ++ ys = x : (xs ++ ys)

i.e., it has to “re-build” the entire first list as the result is traversed. This article is very helpful for understanding how to reason about lazy code in this way.

The key thing to realise is that appending isn’t done all at once; a new linked list is incrementally built by first walking through all of xs, and then putting ys where the [] would go.

So, you don’t have to worry about reaching the end of b and suddenly incurring the one-time cost of “appending” a to it; the cost is spread out over all the elements of b.

Vectors are a different matter entirely; they’re strict in their structure, so even examining just the first element of xs V.++ ys incurs the entire overhead of allocating a new vector and copying xs and ys to it — just like in a strict language. The same applies to mutable vectors (except that the cost is incurred when you perform the operation, rather than when you force the resulting vector), although I think you’d have to write your own append operation with those anyway. You could represent a bunch of appended (immutable) vectors as [Vector a] or similar if this is a problem for you, but that just moves the overhead to when you flattening it back into a single Vector, and it sounds like you’re more interested in mutable vectors.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m curious how I should go about improving the performance of a Haskell routine

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply