I've asked a similar question (Remove duplicate value from dictionary without removing key) but I think I'm getting deeper into the problem and can explain it better because, unfortunately, none of the answers did what I needed but they did answer my question.
I'm building an OrderedDict from two .csv files. The first one contains location codes, while the second one is a list of hardware relocations by time. All location codes are unique so that is the key to my dictionary, I've got a loop that builds the dictionary with empty values using that .csv file. Then I have another loop that adds the hardware data to the correct location code. Some of the hardware data is in a list form so it isn't hashable.
The problem I'm having is that as the hardware moves to a new location, I need it to be removed from its previous location. so its only in one place at the end of the of the code
My location codes are;
>1, 2, 3, 4, 5, 6
My hardware data is in order by time,
>7pm, 1, 'item1', 'item2', 'item3'
>8pm, 2, 'item4', 'item5', 'item6'
>9pm, 3, 'item7', '', ''
>10pm, 4, 'item8', '', ''
>11pm, 5, 'item1', 'item2', 'item3'
>12am, 6, 'item7', '', ''
>1am, 3, 'item4', 'item5', 'item6'
If I run the code for the entire timeframe without any conditional statements my final dictionary looks like
>myDict = {'1': ['item1', 'item2', 'item3'], '2': ['item4', 'item5', 'item6'],
>'3': 'item7', '4': 'item8', '5': ['item1', 'item2', 'item3'], '6': 'item7'}
But what I need it too look like is;
>my Dict = {'1': '', '2':'', '3': ['item4', 'item5', 'item6'], '4':
>'item8', '5': ['item1', 'item2', 'item3'], '6': 'item7'}
Because the items (values) are not added to the dictionary in the same order that the locations (keys) are added, its important that I do this while building the dictionary (adding the values) because I can't go back through and just remove the duplicates after its completed.
I've tried many things and have gotten different results but my latest is
locationCSV = open('location.csv', "r")
hardwareCSV = open('hardware.csv', "r")
locationLines = locationCSV.readlines()
hardwareLines = hardwareCSV.readlines()
finalLoc = OrderedDict()
for line in locationLines:
locationList = line.split(",")
code = locationList[0]
finalLoc[code] = ""
for line in hardwareLines:
hardwareList = line.split(",")
hardwareData = [hardwareList[2],hardwareList[3],hardwareList[4]]
for k, v in finalLoc.iteritems():
if hardwareData in finalLoc.itervalues():
finalLoc[k] = ""
finalLoc[hardwareList[1]] = hardwareData
print finalLoc
This returns all the locations empty. I've been stuck on this for a few days so any help would be appreciated.
There are a number of problems with your code that prevent you from even getting that far, so this can't possibly be your real code. But let's go through the errors:
This is going to give you values like
" 1"
and" 'item1'"
, which I can't imagine is what you actually want.In fact, the fact that your lines have stray whitespace at the end means they won't even match up. For example, the last string in the second line is
" 'item6' "
, but in the last line it's" 'item6'"
, which aren't the same string.This would be much easier if you used the
csv
library instead of trying to do it yourself. If you just want a quick hack to solve the problem, you canstrip
each entry:Since some of your lines only have 3 columns, this is going to raise an
IndexError
. If you just want to just get fewer than 3 values for short rows instead of raising, you can do:For each line, you're going through the entire dictionary, and for each entry, searching the entire dictionary to see if
finalLoc
is a value anywhere. So, if there are already 10 items in the dict, you're going to find each line that already exists 100 times. Which means that if you blank what you find, for each line, you're going to blank every line 10 times.You probably wanted
if hardwareData == v
here.You haven't defined
key
anywhere in the code you've shown us. If you defined it somewhere earlier, it's going to blank out the same value each of the 100 times for each line. Otherwise, this will just raise aNameError
.You probably wanted
finalLoc[k]
here.This whole part would be a lot simpler (and more efficient) if you kept an inverse dictionary, mapping each value to its key.
Anyway, putting together all those fixes, your code works:
The output is:
Here's a version using the
csv
module and an inverse mapping:I still needed to explicitly strip the excess trailing whitespace on each line, but otherwise,
csv
took care of everything for me—and notice that it also removes the excess quotes around each value.Meanwhile, instead of having to figure out how to walk through the items and find every key that matches the current value, the
invmap
lets me just look it up in one step. (Notice that the mapping have to be 1-to-1, because if you'd already encountered the value twice, the first one would already have been removed.)Of course even fixing the stripping and quoting problems, the results still aren't exactly what you want. Your desired output apparently unwraps single-element lists into just the element. If you want that, you'll need to do that explicitly. But you probably don't want that. In fact, you probably want to use
[]
instead of''
as your "empty" value. That way, you know the value is always a list of 0 or more items, instead of having to treat an empty string as 0 values, any other string as 1 value, and a list as multiple values. So, when you process it, instead of writing code like this:… you can just do this: