I was trying to understand how .{n} and ?<option>: works in Regexp on Ruby

Question

0

Asked: June 17, 20262026-06-17T10:48:22+00:00 2026-06-17T10:48:22+00:00

I was trying to understand how .{n} and ?<option>: works in Regexp on Ruby

0

I was trying to understand how .{n} and ?<option>: works in Regexp on Ruby 1.9.3 environment. But couldn’t understand how the below code produce the output:

irb(main):001:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{2}(?m:.)\Z/
=> ["fin\n", "fin\r\n", "find"]
irb(main):002:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{1}(?m:.)\Z/
=> ["fin\n", "fi\n\n"]
irb(main):003:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{1}(?m:.)\Z/
=> []
irb(main):010:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(?m:.)\Z/
=> ["fin\n", "fi\n\n"]
irb(main):011:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(m:.)\Z/
=> []
irb(main):012:0> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(?m:.)\z/
=> []

Can anyone help me to understand how the above code worked to generate the mentioned output in IRB terminal?

Thanks,

As per @Kevin last paragraph I tried below and found expected and desirable output :

irb(main):014:0> %W{fin fi\n\n \n\n fin\r\n find}.grep /f.(?m:.)\z/
=> ["fin"]
irb(main):015:0> %W{fin fi\n\n \n\n fin\r find}.grep /f.(?m:.)\z/
=> ["fin"]
irb(main):016:0> %W{fin fi\n \n\n fin\r\n find}.grep /f.(?m:.)\z/
=> ["fin", "fi\n"]
irb(main):017:0> %W{fin fi\n \n\n fr\n find}.grep /f.(?m:.)\z/
=> ["fin", "fi\n", "fr\n"]
irb(main):018:0>

Thank you very much @Kevin . You helped me to understand the whole concept!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-17T10:48:23+00:00

{n} means “repeat the previous atom n times”. In regular expressions, an atom is a self-contained unit. So a single character is an atom. So is a dot. A group is an atom as well (that contains other atoms), as is a character class. So .{n} means “match n characters” (because . means “match any character”).

Note that {n} is not like a backreference, in that it doesn’t have to match the same text on each repetition. .{5} behaves exactly like ......

This construct is also more powerful. It can take two numbers, and it matches a repetition count for that whole range. So .{3,5} means “match 3 to 5 characters”. And .{3,} means “match 3 or more characters”. ? can be replaced with {0,1}, * with {0,}, and + with {1,} if you so desired.

?<option: isn’t actually a thing. It’s (?<option>:<pattern>), and this turns on all the flags listed in <option> for the duration of <pattern>. It’s like a group, except it doesn’t actually create a back reference. So the expression (?m:.) means “match one character as if the flag m was turned on”. Given the behavior of m as “match \n” as nhahtdh said in the comments, the expression .(?m:.). means “match any character besides newline, followed by any character, followed by any character besides newline”.

This construct has two benefits. First, it allows you to only have a flag apply to part of a pattern, which can be occasionally useful. And second, if you wrap your entire pattern in this construct, then you have control over the flags that apply to your regular expression regardless of where the expression is used. This is useful when you’re providing the regex as a user and don’t have control over the source of the program.

Let’s take a look at the examples you gave:

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{2}(?m:.)\Z/
=> ["fin\n", "fin\r\n", "find"]

Your pattern /f.{2}(?m:.)\Z/ means “match f, followed by 2 of any character (but newline), followed by any character, and anchor to the end of the string or just before a newline”.

So in each of the 3 matches, fin matches the f.{2}. (?m:.) matches \n in the first, \r in the second, and d in the third. And \Z matches the end of the string in the first, just before a newline in the second, and the end of the string in the third.

fi\n\n doesn’t match because the first \n here can’t be matched by the . from .{2} without the m flag.

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{1}(?m:.)\Z/
=> ["fin\n", "fi\n\n"]

Here fi matches f.{1} in both cases. (?m:.) matches n and \n, and \Z matches before the newline in both cases.

fin\r\n doesn’t match because \Z will only match before the final newline in the string, not before a CRLF pair. And find doesn’t match because there’s nothing to match the d.

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.{1}(?m:.)\Z/
=> []

I think you have a copy & paste error here. This is identical to the previous pattern and matches as that does.

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(?m:.)\Z/
=> ["fin\n", "fi\n\n"]

This is also identical to the previous pattern. . and .{1} are the same thing. In fact, {1} can always be stripped from any regular expression without changing anything.

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(m:.)\Z/
=> []

You dropped the ? in this pattern, changing the meaning of (m:.). This no longer changes options. Now it’s just a capturing group that matches the pattern m:., which of course doesn’t occur in your input.

> %W{fin\n fi\n\n \n\n fin\r\n find}.grep /f.(?m:.)\z/
=> []

You changed \Z to \z here. The difference between those two is \Z may match before a trailing newline, but \z must only match the end of the string. Without being able to match before the trailing newline, none of your inputs here match. But, for example, if you had fin (without the newline) or fi\n (without the second newline) it would work.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I was trying to understand how .{n} and ?<option>: works in Regexp on Ruby

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply