You could put your macros in a separate excel file.

Question

0

Asked: May 13, 20262026-05-13T11:23:07+00:00 2026-05-13T11:23:07+00:00

I want to download about 200 different HTML files over HTTPS and extract the

0

I want to download about 200 different HTML files over HTTPS and extract the title of the page from each file and put the titles into a text document.

How would I go about using Perl to download files using HTTPS? I searched Google, but I didn’t find very much helpful information or examples.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T11:23:08+00:00

A good place to look for information on the downloading part is the libwww-perl cookbook.

Here’s some rudimentary sample code. It isn’t necessarily the best way, but it’s one that should work, assuming you have the LWP module (available from CPAN).

#!/usr/bin/perl

use warnings;
use strict;
use LWP::Simple;

while (my $site = <STDIN>)
{
    my $doc = get $site;
    if (defined($doc))
    {
        if ( $doc =~ m/<title>(.*)<\/title>/i )
        {
           print "$1\n";
        }
    }
}

You might want to add more bells and whistles, for unescaping text, handling error conditions, doing requests in parallel with multiple threads, faking user-agent as Mozilla etc 🙂

If you saved this as titlegrab.pl, and you had a list of sites in sites.list (one URL per line), you could use this with $ cat sites.list | perl titlegrab.pl to see all the titles.

Or.. redirect to some output file, e.g. $ cat sites.list | perl titlegrab.pl > results.txt

How to approach applying for a job at a company ...

What is a programmer’s life like?

How to handle personal stress caused by utterly incompetent and ...

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions