I’m trying to create a piece of code but cannot get it working. The simplest example I can think of is parsing some CSV file.
Suppose we have a CVS file, but the data is organized in some kind of hierarchy in it. Like this:
Section1;
;Section1.1
;Section1.2
;Section1.3
Section2;
;Section2.1
;Section2.2
;Section2.3
;Section2.4
etc.
I did this:
let input =
"a;
;a1
;a2
;a3
b;
;b1
;b2
;b3
;b4
;b5
c;
;c1"
let lines = input.Split('\n')
let data = lines |> Array.map (fun l -> l.Split(';'))
let sections =
data
|> Array.mapi (fun i l -> (i, l.[0]))
|> Array.filter (fun (i, s) -> s <> "")
and I got
val sections : (int * string) [] = [|(0, "a"); (4, "b"); (10, "c")|]
Now I’d like to create a list of line index ranges for each section, something like this:
[|(1, 3, "a"); (5, 9, "b"); (11, 11, "c")|]
with the first number being a starting line index of the subsection range and the second – the ending line index. How do I do that? I was thinking about using fold function, but couldn’t create anything.
As far as I know, there is no easy way to do this, but it is definitely a good way to practice functional programming skills. If you used some hierarchical representation of data (e.g. XML or JSON), the situation would be a lot easier, because you wouldn’t have to transform the data structure from linear (e.g. list/array) to hierarchical (in this case, a list of lists).
Anyway, a good way to approach the problem is to realize that you need to do some more general operation with the data – you need to group adjacent elements of the array, starting a new group when you find an line with a value in the first column.
I’ll start by adding a line number to the array and then convert it to list (which is usually easier to work with in F#):
Now, we can write a reusable function that groups adjacent elements of a list and starts a new group each time the specified predicate
freturnstrue:Now, implementing your specific algorithm is relatively easy. We first need to group the elements, starting a new group each time the first column has some value:
In the second step, we need to do some processing for each group. We take its first element (and pick the title of the group) and then find the minimal and maximal line number among the remaining elements:
Note that the pattern matching in the lambda function will give a warning, but we can safely ignore that because the group will always be non-empty.
Summary: This answer shows that if you want to write elegant code, you can implement your reusable higher order function (such as
adjacentGroups), because not everything is available in the F# core libraries. If you use functional lists, you can implement it using recursion (for arrays, you’d use imperative programming as in the answer by gradbot). Once you have a good set of reusable functions, most of the problems are easy :-).