I have the grammar file alexa_scrape.tt : grammar AlexaScrape rule document category_listing* end rule

Question

0

Asked: June 15, 20262026-06-15T03:37:47+00:00 2026-06-15T03:37:47+00:00

I have the grammar file alexa_scrape.tt : grammar AlexaScrape rule document category_listing* end rule

0

I have the grammar file alexa_scrape.tt:

grammar AlexaScrape
  rule document
    category_listing*
  end
  rule category_listing
    category_line url_line*
  end
  rule category_line
    category "\n"
  end
  rule category
    ("/" [^/]+)+
  end
  rule url_line
    [0-9]+ ". " url "\n"
  end
  rule url
    [^\n]*
  end
end

I have a ruby file which attempts to make use of it:

#!/usr/bin/env ruby -I .
require 'rubygems'
require 'polyglot'
require 'treetop'
require 'alexa_scrape.tt'

parser = AlexaScrapeParser.new
p( parser.parse("") || parser.failure_reason )
p( parser.parse("/x\n") || parser.failure_reason )

But I’m not getting the results I expected:

SyntaxNode offset=0, ""
"Expected one of /, \n at line 2, column 1 (byte 4) after /x\n"

It parses the empty string properly (as the trivial match for document, zero category_listings), but fails to parse "/x\n" (as the document containing a single category_listing that itself has zero url_lines).

What am I doing wrong?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-15T03:37:48+00:00

Editorial Team

2026-06-15T03:37:48+00:00Added an answer on June 15, 2026 at 3:37 am

It looks like the regex in category is advancing through the white space needed to match category_line … do this:

  rule category
    ("/" [^/\s]+)+    # or perhaps ("/" [^/\n]+)+
  end

(And, wow, a Treetop question. This is number 47 in the history of SO and its 4 million total questions. One in 87,000 SO questions are tagged Treetop).

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have the grammar file alexa_scrape.tt : grammar AlexaScrape rule document category_listing* end rule

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply