Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7728175
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 1, 20262026-06-01T05:39:08+00:00 2026-06-01T05:39:08+00:00

I am trying get all html links within a string and replace them using

  • 0

I am trying get all html links within a string and replace them using preg_replace to another link (for link tracking etc)

It works fine on links like http://www.facebook.com but not those that do not have a ‘www’ within the string.
So the first URL would be fine, but the latter wouldn’t work – can anyone suggest how I alter my expression to allow BOTH links like this to work.

http://www.twitter.com
Posts by myusername

$message = preg_replace("/<a([^>]+)href=\"http\:\/\/([a-zA-Z0-9\-]+\.[a-zA-Z0-9]+\.[a-zA-Z]{2,3}(\/*)?)/", "<a$1href=\"http://www.site.com/system/link_tracker.php?URL=$2&ID={$ID}\"", $message);
  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-01T05:39:10+00:00Added an answer on June 1, 2026 at 5:39 am

    http://([a-zA-Z0-9-]+.[a-zA-Z0-9]+.[a-zA-Z]{2,3}(/*)

    This is by no means a URL regex. It might work for two or three cases, but you ignore the existence of:

    • https,
    • multiple labels in the domain name (foo15.cdn.amazon.com),
    • dashes in the domain name,
    • Internationalized domain names,
    • TLDs existing of other than 2 or 3 letters (.info, .museum) or multiple parts (.co.uk) and most importantly
    • deep links.

    Users will especially not like the latter, because when they deeplink to some site, this link is rendered invalid using your regex. Use a common way, for example DOMDocument.replaceChild() to alter links in an HTML document (which I assume you use, since you’re capturing URL’s in <a> tags).

    As said in this answer, that can be done with something like this, not tested:

    $dom = new DOMDocument();  
    $html = @$dom->load(...); // Load your html
    $links = $dom->getElementsByTagName('a'); 
    
    foreach ($links as $link)
    {
        // Store original node
        $origLink = $link;
    
        // Find original href
        $href = $link->getAttribute('href');
    
        // Replace link
        $href = "http://www.site.com/system/link_tracker.php?URL=" . urlencode($href) . "&ID={$ID}";
    
        // Replace href
        $link->setAttribute('href', $href);
    
        // Replace child (don't know if this is required because you already alter $link)
        $dom->replaceChild($link, $origLink);
    
    }   
    
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I am trying get all html links within a string and replace them using
I am trying to find all the broken links in the webpage using Java.
Im trying to render a cms page, within another page using a custom cms
Using Selenium IDE, I am trying to locate a link within a table row.
iam trying to get all object's xpath's from loaded page via selenium something similar
I'm trying to get all the direct reports of a User through Active Directory,
I'm trying to get all property names / values from an Outlook item. I
I am trying to get all the rows that exist in allData but not
I'm trying to get all controls in a winform disabled at the Load event.
I'm trying to get all the input elements from a certain form from jQuery

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.