Driving myself nuts. I am trying to get just the domain name (http://www.example.com) out

Question

0

Asked: June 7, 20262026-06-07T14:29:49+00:00 2026-06-07T14:29:49+00:00

Driving myself nuts. I am trying to get just the domain name (http://www.example.com) out

0

Driving myself nuts. I am trying to get just the domain name (http://www.example.com) out of access.log. What the log looks like:

tail access.log 

Fri, 13 Jul 2012 20:32:03 -0700,INFO,6fgmd8fk,params,http://www.example.com/images/CIV-260.jpg|

I have tried many variations of this one-liner (with sed and awk):

tail -4 access.log |grep http |awk {'print $6'} |cut -c28- |awk '$1>".com"' |sort |uniq

http://www.example.com/2713-7807.jpg|
http://www.example.com/2713-7808.jpg|
http://barfoo.com/img/14616_20120711182527.jpg|
http://foobar.com/css/14616_20120713142151.css|

I am stuck.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-07T14:29:52+00:00

Editorial Team

2026-06-07T14:29:52+00:00Added an answer on June 7, 2026 at 2:29 pm

Using grep:

grep -Po '(?<=http://)[^/]+' access.log | sort -u

If you want to have http:// as a part of domain name,

grep -Po 'http://[^/]+' access.log | sort -u

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Driving myself nuts. I am trying to get just the domain name (http://www.example.com) out

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply