Scenario: Attempting to import many (>100), large(>1M recs) flat-files (csv).
Problem: Many records are missing field delimiters.
Asking: Is there a parsing tool that will attempt to define and validate the file and allow you to make ‘in-line’ corrections?
ETA: I’m attempting to import this files to MS SQL Server using the DTS Import Wizard. The error message gives me the line number of the file that it breaks on. Fix; repeat.
Been there, done that. Wrote my own tool.
It’s remarkable how many programs that allegedly output CSV don’t actually do so correctly.
A commercial tool would be nice, but given the variety of problems that I’ve encountered in CSV files (missing delimiters, bad separator values, embedded CR/LF in the middle of fields, etc.), it was worth writing my own. That way when I find a new problem I just extend my existing program to deal with it.
I should probably change my nickname to NIH given my propensities.