I have a transaction log file in CSV format that I want use to run statistics. The log has the following fields:
date: Time/date stamp salesperson: The username of the person who closed the sale promo: sum total of items in the sale that were promotions. amount: grand total of the sale
I’d like to get the following statistics:
salesperson: The username of the salesperson being analyzed. minAmount: The smallest grand total of this salesperson's transaction. avgAmount: The mean grand total.. maxAmount: The largest grand total.. minPromo: The smallest promo amount by the salesperson. avgPromo: The mean promo amount...
I’m tempted to build a database structure, import this file, write SQL, and pull out the stats. I don’t need anything more from this data than these stats. Is there an easier way? I’m hoping some bash script could make this easy.
Could also bang out an awk script to do it. It’s only CSV with a few variables.