I have a simple 4-column Excel spreadsheet that matches universities to their ID codes for lookup purposes. The file is pretty big (300k).
I need to come up with a way to turn this data into a populated table in my Rails app. The catch is that this is a document that is updated now and then, so it can’t just be a one-time solution. Ideally, it would be some sort of ruby script that would read the file and create the entries automatically so that when we get emailed a new version, we can just update it automatically. I’m on Heroku if that matters at all.
How can I accomplish something like this?
If you can, save the spreadsheet as CSV, there’s much better gems for parsing CSV files than for parsing excel spreadsheets. I found an effective way of handling this kind of problem is to make a rake task that reads the CSV file and creates all the records as appropriate.
So for example, here’s how to read all the lines from a file using the old, but still effective FasterCSV gem
It does kinda rely on you having a unique column in the original spreadsheet to go off though.
If you put that into a rake task, you can then wire it into you’re Capistrano deploy script, so it’ll be run every time you deploy. the find_or_initialize should ensure you shouldn’t get duplicate records.