I’m trying to add to a DataTable some information in Parallel but if the

Question

0

Asked: June 9, 20262026-06-09T12:54:59+00:00 2026-06-09T12:54:59+00:00

I’m trying to add to a DataTable some information in Parallel but if the

0

I’m trying to add to a DataTable some information in Parallel but if the the loop is to long it freezes or just takes a lot of time, more time then an usual for loop, this is my code for the Parallel.For loop:

Parallel.For(1, linii.Length, index =>
                 {
                     DataRow drRow = dtResult.NewRow();
                     alResult = CSVParser(linii[index], txtDelimiter, txtQualifier);

                     for (int i = 0; i < alResult.Count; i++)
                     {
                         drRow[i] = alResult[i];
                     }
                     dtResult.Rows.Add(drRow);
                 }
             );

What’s wrong? this Parallel.For loop takes much more time than a normal one, what is wrong?

Thanks!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-09T12:55:00+00:00

You can’t mutate a DataTable from 2 different threads; it will error. DataTable makes no attempt to be thread-safe. So: don’t do that. Just do this from one thread. Most likely you are limited by IO, so you should just do it on a single thread as a stream. It looks like you’re processing text data. You seem to have a string[] for lines, perhaps File.ReadAllLines() ? Well, that is very bad here:

it forces it all to load into memory
you have to wait for it all to load into memory
CSV is a multi-line format; it is not guaranteed that 1 line == 1 row

What you should do is use something like the CsvReader from code project, but even if you want to just use one line at a time, use a StreamReader:

using(var file = File.OpenText(path)) {
    string line;
    while((line = file.ReadLine()) != null) {
        // process this line
        alResult = CSVParser(line, txtDelimiter, txtQualifier);

        for (int i = 0; i < alResult.Count; i++)
        {
            drRow[i] = alResult[i];
        }
        dtResult.Rows.Add(drRow);
    }
}

This will not be faster using Parallel, so I have not attempted to do so. IO is your bottleneck here. Locking would be an option, but it isn’t going to help you massively.

As an unrelated aside, I notice that alResult is not declared inside the loop. That means that in your original code alResult is a captured variable that is shared between all the loop iterations – which means you are already overwriting each row horribly.

Edit: illustration of why Parallel is not relevant for reading 1,000,000 lines from a file:

Approach 1: use ReadAllLines to load the lines, then use Parallel to process them; this costs [fixed time] for the physical file IO, and then we parallelise. The CPU work is minimal, and we’ve basically spent [fixed time]. However, we’ve added a lot of threading overhead and memory overhead, and we couldn’t even start until all the file was loaded.

Approach 2: use a streaming API; read each one line by line – processing each line and adding it. The cost here is basically again: [fixed time] for the actual IO bandwidth to load the file. But; we now have no threading overhead, no sync conflicts, no huge memory to allocate, and we start filling the table right away.

Approach 3: If you really wanted, a third approach would be a reader/writer queue, with one dedicated thread processing file IO and enqueueing the lines, and a second that does the DataTable. Frankly, it is a lot more moving parts, and the second thread will spend 95% of its time waiting for data from the file; stick to Approach 2!

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m trying to add to a DataTable some information in Parallel but if the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply