The following code truncates a number of type Double to one in the type

Question

0

Asked: June 18, 20262026-06-18T15:23:02+00:00 2026-06-18T15:23:02+00:00

The following code truncates a number of type Double to one in the type

0

The following code truncates a number of type Double to one in the type Word16 (although I suspect any other word type behaves similarly, I had to choose one for the example).

truncate1 :: Double -> Word16
truncate1 = fromIntegral . (truncate :: Double -> Int)

As you can read, I first truncate it to Int and only then I cast it to Word16. I benchmarked this function agains a direct truncation:

truncate2 :: Double -> Word16
truncate2 = truncate

Surprisingly to me, the first version (going thru the Int type first) performed much better. Or the second much worse. According to the criterion output:

benchmarking truncate/truncate1
mean: 25.42399 ns, lb -47.40484 ps, ub 67.87578 ns, ci 0.950
std dev: 145.5661 ns, lb 84.90195 ns, ub 244.2057 ns, ci 0.950
found 197 outliers among 100 samples (197.0%)
  97 (97.0%) low severe
  100 (100.0%) high severe
variance introduced by outliers: 99.000%
variance is severely inflated by outliers

benchmarking truncate/truncate2
mean: 781.0604 ns, lb 509.3264 ns, ub 1.086767 us, ci 0.950
std dev: 1.436660 us, lb 1.218997 us, ub 1.592479 us, ci 0.950
found 177 outliers among 100 samples (177.0%)
  77 (77.0%) low severe
  100 (100.0%) high severe
variance introduced by outliers: 98.995%
variance is severely inflated by outliers

To be honest, I just started using Criterion, so I’m not an expert using it, but I understand that 25.42399 ns is a shorter execution time than 781.0604 ns. I suspect that some specialization is playing a role here. Is it truncate2 too slow? Being the case, can truncate be improved? Furthermore, anybody knows an even faster way? I feel like doing something wrong casting to a type I don’t really use.

Thanks in advance.

I am compiling with GHC-7.4.2, optimizations enabled (-O2).

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-18T15:23:03+00:00

First, note that the module GHC.Word includes the following RULE pragma:

"truncate/Double->Word16"
    forall x. truncate (x :: Double) = (fromIntegral :: Int -> Word16) (truncate x)

This is a simple rewrite rule to perform precisely the optimization your truncate1 provides. So we have a few questions to consider:

Why is this an optimization at all?

Because the default implementation of truncate is generic, to support any Integral instance. The speed difference you see is the cost of that generality; in the specific case of truncating one primitive type to another, there are much faster methods available.

So it seems that truncate1 is benefiting from a specialized form, while truncate2 is not.

Why is `truncate1` faster?

In GHC.Float, where the RealFrac instance for Double is defined, we have the following RULE pragma:

"truncate/Double->Int"              truncate = double2Int

Where double2Int is the optimized form we want. Compare this to the RULE mentioned earlier–apparently there’s no similar primitive operation specifically for converting Double to Word16.

Why doesn’t `truncate2` get rewritten as well?

Quoth the GHC User’s Guide:

GHC currently uses a very simple, syntactic, matching algorithm for matching a rule LHS with an expression. It seeks a substitution which makes the LHS and expression syntactically equal modulo alpha conversion. The pattern (rule), but not the expression, is eta-expanded if necessary.

Expressions being matched are not eta-expanded, which is to say that a rule matching on forall x. foo x will match in bar y = foo y but not in bar = foo.

Since your definitions are all written point-free, the RULE for Double -> Int matches, but the RULE for Double -> Word16 does not.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

The following code truncates a number of type Double to one in the type

Leave an answerCancel reply

1 Answer

Why is this an optimization at all?

Why is truncate1 faster?

Why doesn’t truncate2 get rewritten as well?

Leave an answer
Cancel reply

Why is `truncate1` faster?

Why doesn’t `truncate2` get rewritten as well?