Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 881055
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 15, 20262026-05-15T12:11:58+00:00 2026-05-15T12:11:58+00:00

I want to find most often seen string in a huge log file. Can

  • 0

I want to find most often seen string in a huge log file. Can someone help me how to do this. one way to do this is to hash each and every string and count the maximum value but its not efficient. Are there any better ways to do this.

Thanks & Regards,

Mousey.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-15T12:11:59+00:00Added an answer on May 15, 2026 at 12:11 pm

    If performance is critical you may want to look at a trie or a Radix tree.


    If you’re just interested to know if one of the strings appears more than 50% of the times (let’s call that string the majority string) you can do something like this (see if I can get this right):

    1. get the first string and assume it’s the majority string and set it’s occurrence count to 1;

    2. get the next string

    3. if it’s the same as the current majority candidate increment it’s occurrence count

    4. otherwise decrement the occurrence count

    5. if the occurrence count reaches 0 replace the majority candidate with the current string

    6. repeat from 2 as long as you have strings to read

    7. if at the end the occurrence count is greater than 0 rescan the log and count the actual number of occurrences of the candidate to check if it really is the majority string.

      So you’ll have to go through the log twice.

    Note: This is from a problem used in an ACM programming contest a while ago, available here.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I want to find a way to copy one file to multiple locations simultaneously
In SQL Server 2005 If I want to find the right-most one character of
I want to find most optimal way to iterate values in key in python.
I want to find the most recent commit that modified a source file. I
I want to find the most used colour in an image using python. for
How to find out what class is referenced the most? I want to find
I want find all Saturdays and Sundays in A given month. How can I
I want to find the Xelement attribute.value which children have a concrete attribute.value. string
I want to find files containing the word navbar anywhere in files. I can
One thing that I find most annoying about OOP is that whenever you need

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.