Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 881055
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 15, 20262026-05-15T12:11:58+00:00 2026-05-15T12:11:58+00:00

I want to find most often seen string in a huge log file. Can

  • 0

I want to find most often seen string in a huge log file. Can someone help me how to do this. one way to do this is to hash each and every string and count the maximum value but its not efficient. Are there any better ways to do this.

Thanks & Regards,

Mousey.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-15T12:11:59+00:00Added an answer on May 15, 2026 at 12:11 pm

    If performance is critical you may want to look at a trie or a Radix tree.


    If you’re just interested to know if one of the strings appears more than 50% of the times (let’s call that string the majority string) you can do something like this (see if I can get this right):

    1. get the first string and assume it’s the majority string and set it’s occurrence count to 1;

    2. get the next string

    3. if it’s the same as the current majority candidate increment it’s occurrence count

    4. otherwise decrement the occurrence count

    5. if the occurrence count reaches 0 replace the majority candidate with the current string

    6. repeat from 2 as long as you have strings to read

    7. if at the end the occurrence count is greater than 0 rescan the log and count the actual number of occurrences of the candidate to check if it really is the majority string.

      So you’ll have to go through the log twice.

    Note: This is from a problem used in an ACM programming contest a while ago, available here.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I want to find a sql command or something that can do this where
I want to find the most recent commit that modified a source file. I
I want to find a way to develop database projects quickly in Visual Studio.
I want to find any text in a file that matches a regexp of
i want to find the mime-type for a given file extension on an IIS
I want to find a linux command that can return a part of the
Maybe is a often repeated question here, but i can't find anything similar with
I'm baffled that I can't find a quick answer to this. I'm essentially looking
I'm working with mysql and I want to find the most common email suffixes
What is the quickest and most efficient way of finding a string within another

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.