How can I design an algorithm which can return the 10 most frequently used

Question

0

Asked: June 13, 20262026-06-13T11:21:02+00:00 2026-06-13T11:21:02+00:00

How can I design an algorithm which can return the 10 most frequently used

0

How can I design an algorithm which can return the 10 most frequently used words in a document in O(n) time? If additional space can be used.

I can parse and place the words in a hash map with count . But next I have to sort the values to get the most frequent ones . Also I have to have a mapping btw the values -> Key which cannot be maintained since values may be repeating.

So how can I solve this ?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T11:21:03+00:00

It may be done in O(n) if you use the correct data structure.

Consider a Node, consisting of 2 things:

A counter (initially set to 0).
An array of 255 (or whatever number of characters) pointers to Node. All the pointers are initially set to NULL.

Create a root node. Define a “current” Node pointer, set it to root node initially.
Then walk through all the characters of the document and do the following:

If the next characters is not a space – pick the appropriate pointer from the array of the current node. If it’s NULL – allocate it. The current Node pointer is updated.
If it’s a space (or whatever word delimiter) – increment the counter of the “current” Node. Then reset the “current” Node pointer to point to the root node.

By such you build a tree in O(n). Every element (both node and leave) denote a specific word, together with its counter.

Then transverse the tree to find the node with the largest counter. It’s also O(n), since the number of elements in the tree is not bigger than O(n).

Update:

The last step is not mandatory. Actually the most common word may be updated during the character processing.
The following is a pseudo-code:

struct Node
{
    size_t m_Counter;
    Node* m_ppNext[255];
    Node* m_pPrev;

    Node(Node* pPrev) :m_Counter(0)
    {
        m_pPrev = pPrev;
        memset(m_ppNext, 0, sizeof(m_ppNext));
    }
    ~Node()
    {
        for (int i = 0; i < _countof(m_ppNext) i++)
            if (m_ppNext[i])
                delete m_ppNext[i];
    }

};

Node root(NULL);
Node* pPos = &root;
Node* pBest = &root;
char c;

while (0 != (c = GetNextDocumentCharacter()))
{
    if (c == ' ')
    {
        if (pPos != &root)
        {
            pPos->m_Counter++;

            if (pBest->m_Counter < pPos->m_Counter)
                pBest = pPos;

            pPos = &root;
        }
    } else
    {
        Node*& pNext = pPos->m_ppNext[c - 1];
        if (!pNext)
            pNext = new Node(pPos);
        pPos = pNext;
    }
}

// pBest points to the most common word. Using pBest->m_pPrev we iterate in reverse order through its characters

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

How can I design an algorithm which can return the 10 most frequently used

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply