To make matter more specific:
- How to detect people names (seems like simple case of named entity extraction?)
- How to detect addresses: my best guess – find postcode (regexes); country and town names and take some text around them.
- As for phones, emails – they could be probably caught by various regexes + preprocessing
- Don’t care about education/working experience at this point
Reasoning:
In order to build a fulltext index on resumes all vulnerable information should be stripped out from them.
P.S. any 3rd party APIs/services won’t do as a solution.
The problem you’re interested in is information extraction from semi structured sources. http://en.wikipedia.org/wiki/Information_extraction
I think you should download a couple of research papers in this area to get a sense of what can be done and what can’t.