response.text is printing only special symbols for a plain-text response

Question

A GET request downloads following output (checked the response with Chrome Dev Tools): HTML output Output via response.content When I am printing response.content to the console or to a file I am getting something like this: Output via response.text With response.text I got this (as depicted in image): Original Code All variables are already defined: How can the plain-text response

Accepted Answer

For evaluating a response from an arbitrary GET request, you should always evaluate the response.headers.The header with key Content-Type tells you something about the MIME type like text/html or application/json of a response and its encoding like UTF-8.In your case the result of response.headers['Content-Type'] probably would return "text/html; charset=UTF-8".So you know, that you need to decode the response from UTF-8 as Parvat. R commented by r.content.decode('utf-8').Here we caneither use response.encoding to dynamically decode the response.text based on response&#8217;s given encodingor we can simply use response.content to get the bytes as binary representation (e.g. b'x833x01')Since you claim the response was text/HTML (as seen in browser), you could simply decode the textual representation and append it to the text-file:s = requests.Session()r = s.get(url,headers = headers)print(r.text)            if (r.status_code == 200):    print("Generated Successfully")    # detect encoding and decode respectively    print("Response encoding", r.encoding)    body_text = r.text.decode(r.encoding)    with open("Alt.txt", 'a') as f:        f.write(str(body_text) + 'n')  # print body as string to fileelse:    print("BAD Request " + str(r.status_code))    s.cookies.clear()See also:python requests.get() returns improperly decoded text instead of UTF-8?

response.text is printing only special symbols for a plain-text response

HTML output

Output via `response.content`

Output via `response.text`

Original Code

Advertisement

Answer