Suppose I have a bulleted list like this:
* list item 1
* list item 2 (a parent)
** list item 3 (a child of list item 2)
** list item 4 (a child of list item 2 as well)
*** list item 5 (a child of list item 4 and a grand-child of list item 2)
* list item 6
I’d like to parse that into a nested list or some other data structure which makes the parent-child relationship between elements explicit (rather than depending on their contents and relative position). For example, here’s a list of tuples containing an item and a list of its children (and so forth):
Edit: Hopefully, a more correct list example, where each element in the list is a tuple containing: a bullet’s text and, if applicable, a list of children (in the same form).
[('list item 1',),
('list item 2', [('list item 3',), ('list item 4', [('list item 5',)])]
('list item 6',)]
[('list item 1',),
('list item 2', [('list item 3',), ('list item 4', [('list item 5',)])]),
('list item 6',)]
I’ve attempted to do this with plain Python and some experimentation with Pyparsing, but I’m not making progress. I’m left with two major questions:
- What’s the strategy I need to employ to make this work? I know recursion is part of the solution, but I’m having a hard time making the connection between this and, say, a Fibonacci sequence.
- I’m certain I’m not the first person to have done this, but I don’t know the terminology of the problem to make fruitful searches for more information on this topic. What problems are related to this so that I can learn more about solving these kinds of problems in general?
I can’t parse your desired result — it seems to have more open parentheses than corresponding closed ones and I don’t understand the logic behind it.
To make a tree structure explicit, what about, e.g.:
The
showmethod of course exists just for the purpose of verifying that the part about parsing the strings and creating the tree has worked correctly. If you can specify more precisely exactly how it is that you want to transform your tree into nested tuples and lists, I’m sure it will be easy to add to classNodethe appropriate (and probably recursive) method — so, could you please give this missing specification…?Edit: since the OP has clarified now, it does, as predicted, become easy to satisfy the spec. Just add to
class Nodethe following method:and change the last three lines of the code (last one of
makenest, a blank one, and the module-level call tomakenest) to:(The parentheses after
printare redundant but innocuous in Python 2, required in Python 3, so I put them there just in case;-).With these tiny changes, my code runs as requested, now emitting