I’ve been working on a program, but due to Mac OS X’s difficulties in

Question

0

Asked: May 23, 20262026-05-23T18:06:35+00:00 2026-05-23T18:06:35+00:00

I’ve been working on a program, but due to Mac OS X’s difficulties in

0

I’ve been working on a program, but due to Mac OS X’s difficulties in updating python, I’ve been doing it in both 3.2 and 2.6, nevertheless, both versions of the script give me IOErrors (they’re different though). Here’s the script:

This is the 3.2 version:

import sys
import os 
import re 
import urllib 
import urllib.request

## opens the URL as a bytes object
urlfilebytes = urllib.request.urlopen('http://www.reddit.com/r/fffffffuuuuuuuuuuuu')
## saves the bytes object to a string
urlfile = urlfilebytes.read().decode('utf-8'))
## saves list of matches for pattern
matches = re.findall(r'[http://imgur.com/][\s]+"', open(urlfile).read())

This returns the error:
TypeError: invalid file:

The 2.6 version on the other hand:

import sys
import os
import re
import urllib
urlfilebytes = urllib.urlopen('http://www.reddit.com/r/fffffffuuuuuuuuuuuu')
urlfile = urlfilebytes.read().decode('utf-8')
matches = re.findall(r'[http://imgur.com/][\s]+"', open(urlfile).read())

This returns the error:

IOError: [Errno 63] File name too long: u'<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"><html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en" ><head><title>FFFFFFFUUUUUUUUUUUU-</title><meta name="keywords" content=" r **ETC ETC ETC**

I’m kind of stumped here, can anyone help me out?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T18:06:35+00:00

Editorial Team

2026-05-23T18:06:35+00:00Added an answer on May 23, 2026 at 6:06 pm

Are you sure you don’t want to just do this?

re.findall(r'[http://imgur.com/][\s]+"', urlfile)

And I bet the regexp doesn’t do what you think it does. Perhaps you need to ask another question about that

Perhaps something like this

re.findall(r'(http://imgur.com/\S+)"', urlfile)

or this

re.findall(r'http://imgur.com/(\S+)"', urlfile)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’ve been working on a program, but due to Mac OS X’s difficulties in

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply