Due to redbubble.com’s lack of an API, I’m using an ATOM feed to steal

Question

0

Editorial Team

Asked: May 21, 20262026-05-21T21:31:33+00:00 2026-05-21T21:31:33+00:00

Due to redbubble.com’s lack of an API, I’m using an ATOM feed to steal

0

Due to redbubble.com’s lack of an API, I’m using an ATOM feed to steal information about a user’s pictures.

This is what the XML looks like:

<entry>
  <id>ID</id>
  <published>Date Published</published>
  <updated>Date Updated</updated>
  <link type="text/html" rel="alternate" href="http://www.redbubble.com/link/to/post"/>
  <title>Title</title>
  <content type="html">
    Blah blah blah stuff about the image..
    &lt;a href="http://www.redbubble.com/products/configure/config-id"&gt;&lt;img src="http://ih1.redbubble.net/path-to-image" alt="" /&gt;
  </content>
  <author>
  <name>Author Name</name>
  <uri>http://www.redbubble.com/people/author-user-name</uri>
  </author>
  <link type="image/jpeg" rel="enclosure" href="http://ih0.redbubble.net/path-to-the-original-image"/>
  <category term="1"/>
  <category term="2"/>
</entry>

Basically using regex… how would I go about getting the href property inside the link in the content tag?

One thing we know for sure is it will always have configure in the path i.e. http://somesite.com/**configure**/id

So basically I just need to find the URL with configure in and grab the whole thing…

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-21T21:31:34+00:00

Editorial Team

2026-05-21T21:31:34+00:00Added an answer on May 21, 2026 at 9:31 pm

Thanks for your awesome answers but my colleague solved it for me!

This is what i ended up using:

/http:\/\/([^"\/]*\/)*configure\/([^"]*)/

(Ruby regex by the way)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Due to redbubble.com’s lack of an API, I’m using an ATOM feed to steal

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply