I have scraped a webpage using Scrapy and need to extract the background color

Question

0

Asked: May 27, 20262026-05-27T08:32:29+00:00 2026-05-27T08:32:29+00:00

I have scraped a webpage using Scrapy and need to extract the background color

0

I have scraped a webpage using Scrapy and need to extract the background color from certain objects. Because inline-css is not part of the DOM, or so I have read, I need to create a regex that will augment my current XPath and select the needed value within an object’s style attribute. My current XPath returns the entire style value like so:

background:#80FF00;height:48px;width:98px;color:#FFFFFF

I need a regex that will select the background hex value only (ie: #80FF00). I do not need to verify the value is properly formated (ie ([0-9A-Fa-f]{3}|[0-9A-Fa-f]{6}))\b ), just need to grab whatever is between ‘background:’ and the following ‘;’.

I am new to writing regular expressions and appreciate the help.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-27T08:32:30+00:00

The following regex should do what you want, the stuff you want to grab will be in the first capture group:

background:(.*?);

In Python

background = re.search(r'background:(.*?);', some_string).group(1)

. matches any character, * means repeat the previous element any number of times, and the ? makes it a lazy match, so it will match as few characters as possible. This is necessary to make sure that it doesn’t capture multiple semicolons and only stop at the last one. An alternative would be background:([^;]*) since [^;] would only match non-semicolon characters.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have scraped a webpage using Scrapy and need to extract the background color

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply