This Python code gives unwanted output when query_words if of size greater than 1

Question

I've written some code, but it does not output what I expected. Here is the code: The expected final value of big_ds is: {123: {'dollar': ['currency'], 'probabilistic': []}, 108: {'dollar': [], 'probabilistic': ['probabilistic']}} But the code sets the value of big_ds to the following: {123: {'dollar': ['currency'], 'probabilistic': ['currency']}, 108: {'dollar': ['probabilistic'], 'probabilistic': ['probabilistic']}} I asked a similar question a

Accepted Answer

It&#8217;s because:dict.fromkeys(query_words, [])&#8230;the keys in each mail_id sub-dict each share the same list instance.See:&#8220;Least Astonishment&#8221; and the Mutable Default ArgumentDictionary creation with fromkeys and mutable objects. A surpriseTry this instead:query_words = ['dollar', 'probabilistic']query_word_to_synonym_dict = {'probabilistic': ['probabilistic'], 'dollar' : ['currency']}mail_ids = {123, 108}big_ds = {}index = {'probabilistic':{(108, 1)}, 'currency':{(123, 1)}}for mail_id in mail_ids:    big_ds[mail_id] = {word: [] for word in query_words}for query_word in query_words:    syns = query_word_to_synonym_dict[query_word]    for syn in syns:        index_of_word = index[syn]        tuple_first = []        for tuples in index_of_word:            tuple_first.append(tuples[0])        for number in tuple_first:            big_ds[number][query_word].append(syn)print(big_ds)

Advertisement

Answer