The Background: I have been creating a script that based on input csv’s that

Question

0

Asked: May 20, 20262026-05-20T08:19:18+00:00 2026-05-20T08:19:18+00:00

The Background: I have been creating a script that based on input csv’s that

0

The Background:

I have been creating a script that based on input csv’s that exist within an input directory, creates a 3 dimensional array to store the aggregated information. Each table within the array represents one of the pollution sources (eg one of the input csvs was Incinerators.csv, the created table will be the aggregated information about various pollutants released by Incinerators on a watershed scale), each row represents the aggregated information by watershed – row 0 = headers, and each column is the amount of and toxic equivalent of each substance – col 0 = watershed ID.

For each substance in each watershed, the total released by all sources is calculated and stored in another array with the exact same layout addressable using totals[wsid][substance] by index or name based dictionary lookups.

The Question:

With this table of totals, I need to calculate each watershed’s relative rank for the amount of each substance released compared to what is released in other watersheds.

I could use a couple of nested loops to go through each substance column and convert this into a list, sort the list, and then relate this back to the watershed ID… but this would not be a very clean solution. Zero values also need to be omitted from ranking and duplicate values should be given the same rank while decreasing total number being ranked.

Is there a smarter way to do this? Or a module where this is already implemented? (didn’t see anything evident in pyTables)

One of the requirements is that the solution also remain simple enough so that those with even less python experience than I will at least be able to understand the process. I can use up to 2.7.1

The End Goal:

Generate HTML pages to be iframed from a Google Earth description bubble with the results. I have put a couple entirely unfinished sample outputs here.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-20T08:19:18+00:00

For this I have created 2 functions

def sortTable(table, col):
    return sorted(table, key=itemgetter(col))

And

def buildRankTable(totalTable, fieldList, wsidList, subList, subDict, wsidDict):
    ## build rankTable to mimic other templates
    rankTable = newTemplateTable(wsidList, fieldList)

    ## add another row to track total number ranked for each substance
    numRanked = [0 for i in range(len(fieldList))]
    numRanked[0] = "TotalNoRanked"
    rankTable.append(numRanked)

    for substance in subList:
        tempTable = sortTable(totalTable, subDict[substance])
        exportCsv(tempTable, outdir + os.sep + "rankT_" + substance + ".csv")
        rankList = []
        ## extract a the low to high list of wsid's, skipping non-floats (no measurement)
        for row in tempTable:
            if type(row[subDict[substance]]) == float:
                rankList.append(row[0]) ## build wsid list in ranked order
        numRanked[subDict[substance]] = len(rankList)

        ## by default, this ranks low to high, we want to rank high to low starting at 1
        rankList.reverse()

        ## with the list of ranked wsids, get the rank and save to rankTable
        for rank, wsid in enumerate(rankList): 
            rankTable[wsidDict[wsid]][subDict[substance]] = rank + 1

    ## any 0 (default) values become 'NR' - No Rank
    for rowI in range(len(rankTable)):
        for colI in range(len(rankTable[rowI])):
            if rankTable[rowI][colI] == 0:
                rankTable[rowI][colI] = "NR"
    return rankTable'

fieldList = list of fields in first row
wsidList = list of wsid’s (remaining 595 rows)
subList = list of substances to be ranked
subDict = dictionary to map each substance to it’s col index in totalTable
wsidDict = dictionary to map each wsid it it’s row index in totalTable

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

The Background: I have been creating a script that based on input csv’s that

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply