I’m having a problem trying to ignore whitespace in-between certain characters. I’ve been Googling around for a few days and can’t seem to find the right solution.
Here’s my code:
// Get Image data
preg_match('#<a href="(.*?)" title="(.*?)"><img alt="(.*?)" src="(.*?)"[\s*]width="150"[\s*]height="(.*?)"></a>#', $data, $imagematch);
$image = $imagematch[4];
Basically these are some of the scenarios I have:
<a href="/wiki/File:Sky1.png" title="File:Sky1.png"><img alt="Sky1.png" src="http://media-mcw.cursecdn.com/thumb/5/56/Sky1.png/150px-Sky1.png"width="150" height="84"></a>
(Notice the lack of a space between width="" and src="")
And
<a href="/wiki/File:TallGrass.gif" title="File:TallGrass.gif"><img alt="TallGrass.gif" src="http://media-mcw.cursecdn.com/3/34/TallGrass.gif" width="150"height="150"></a>
(Notice the lack of a space in between width="" and height="".)
Is there anyway to ignore the whitespace in between those characters? As I am not a Regex expert.
Add a
\s?if a space can be allowed.\s stands for white space
? says the preceding character may occur once or not occur.
If more than one spaces are allowed and is optional, use
\s*.* says preceding character can occur zero or more times.
allows an optional space between attribute name and =.
If you want an optional space after the
=also, add a\s?after it also.Likewise, wherever you have optional characters, you can use
?if the maximum occurrence is 1 or*if the maximum occurrence is unlimited, following the optional character.And your actual problem was
[\s*]which causes occurrence of a whitespace or a * as characters enclosed in[and]is a character class. A character class allows occurrence of any of its members once (so remove*from it) and if you append a quantifier (?,+,*etc) after the]any character(s) in the character class can occur according to the quantifier.