Let me explain with an example. We have the following text: Comme Il Faut

Question

0

Asked: May 21, 20262026-05-21T11:46:52+00:00 2026-05-21T11:46:52+00:00

Let me explain with an example. We have the following text: Comme Il Faut

0

Let me explain with an example.
We have the following text:

“Comme Il Faut was founded in 1927. The tobacco company is most well known for its reputation of producing customized private label brands for its partners worldwide”.

This is normal text. But the following text:

“CommeIlFautwasfounded in 1927. The tobacco companyi most wellknown foritsreputation of producing customizedprivatelabelbrands foritspartners worldwide”

This is text anomaly: typos, words without a space, maybe something else.

How to search for such anomalies?
What algorithms are there for this (statistical)?

It is desirable that the result was a percentage: for example, 80% of the anomalies.

Thanks.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-21T11:46:53+00:00

Editorial Team

2026-05-21T11:46:53+00:00Added an answer on May 21, 2026 at 11:46 am

Construct a Trie tree with all the known words in the dictionary.
Take each word that apears in your text and try to find it in the Trie tree. If you don’t find it then try to match prefix of length-k. If you find a match then you apply the same procedure to the rest k characters. It’s recursive and it could catch more than two concatenated words

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Let me explain with an example. We have the following text: Comme Il Faut

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply