Combing concurrent.future.as_complete() with dictionary using zip()

Question

I am a first time user of concurrent.futures and following the official guides. Problem: Inside the as_completed() block, how do I access the k, v which is inside the future_to_url? The k variable is vital. Using something like: I stumbled on this post however I cannot decipher the syntax to reproduce Origina…

Accepted Answer

This has nothing to do with futures and more to do with list comprehension.    future_to_url = {executor.submit(visit_url, v): v for k, v in urls.items()}Is looping everything in the urls dict and getting the key and value(k, v) and submitting that to the executor to run visit_url. k and v will not be available outside of the for loop because the scope of those variables belong to the for loop.If you want to have the results of the call and what URL it was called on you can pass the URL back as a return tuple:from tornado import concurrentdef start():    with concurrent.futures.ThreadPoolExecutor(max_workers=50) as executor:        future_to_url = {executor.submit(visit_url, k, v): v for k, v in urls.items()}        for future in concurrent.futures.as_completed(future_to_url):            id, data = future.result()            json = data.json()            print(f"id: {id}")            print(f"data: {json}")def visit_url(id, url):    return id, requests.get(url)urls = {  'id123': 'www.google.com',  'id456': 'www.bing.com',  'id789': 'www.yahoo.com'}After comments made by OP (mainly that this seems dirty by using the scope of the visit_url function to pass context/keys back after exec) I can propose a more OOP way of doing this:import requestsfrom tornado import concurrentclass URL:    def __init__(self, id, url):        self.id = id        self.url = url        self.response = None    def vist(self):        self.response = requests.get(self.url)        return selfdef start():    with concurrent.futures.ThreadPoolExecutor(max_workers=50) as executor:        future_to_url = {executor.submit(c.vist): c for c in urls}        for future in concurrent.futures.as_completed(future_to_url):            data = future.result()            print(f"response: {data.response}")            print(f"id: {data.id}")urls = [  URL('id123', 'http://www.google.com'),  URL('id456', 'http://www.bing.com'),  URL('id789', 'http://www.yahoo.com')]start()This ensures the response, ID and URL are together in their class which might be cleaner for some. The for loop to submit to the executor is simplified as well.

Advertisement

Answer