First, it seems this question is not program related, however I still make up my mind to post this question here since there is opaque definition on it. If you think it is wrong feel free to delete it.
I am curious about how Google and other search engines determine keywords? Do they just find keywords like we do by using CTRL+F and counting, or they use some obscure semantics technologies to single out keywords?
The reason I ask this question is when I check Cached pages in search engines, they usually present keywords in different colors. Like this one:
But it seems they do not count keywords car in the word like carpad. While, as you know, CTRL+F cont car from carpad.
So my question is if I put company name like carpad, or cardoctor in my site, is it helpful to keyword of car.
Disclaimer: Car, carpad and the URL I provided are just examples. And I hope I made myself clear.
Thanks in advance!
One of the basic techniques they use is the text of the keywords that’s placed in the links to that site. For example, when you link to an article about Obama’s party crashers – note that the link text was “Obama party crashers”. Google can determine that the destination site is about that topic.
Next, it’s using recursive inferences. If I know that sites A and B are about topic X, and they both link to site C, I can assume that site C is also about topic X.
Next, it’s actual textual mining of the content of the site. Techniques such as TF/IDF are used to determine most relevant keywords from a given page’s content.