I have to write a simple piece of code that acts on a file; my software have to scans the file searching for a pattern with a regex (I never had to use the regex in C, but generally speaking, I know them) and it always have to pretends to decode and encode in UTF8.
The file has also a UNIX file format and it could be really nice to have multiple regex running at the same time like with an OR statement or similar.
I use the
pcrelibrary for using regex in C/C++. Originally because we were porting perl code to C, and its a good simple library with UTF-8 support. http://www.pcre.org/And when we have to deal with non UTF-8 encodings we use the
iconvlibrary to convert to UTF-8 first. http://www.gnu.org/s/libiconv/