Python – trying to get beautifulsoup to find words in a list, but it’s unable to find them

Question

I'm working on my first project that isn't straight out of a book but I'm having trouble getting a function to work. The function receives a list of strings and a BeautifulSoup object and attempts to find each word in the soup.text. However, the code seems unable to find any words/strings at all even when I am certain it should

Accepted Answer

did you use try except block? the problem maybe with file encoding because I got an error with soup.txtUnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 91868:And words_count will always 0 or 1, you need to use .count() or Regex to count how many times the substring is present in itimport redef find_words(words_list, urlSoup):    url = 'soup.txt'    for word in words_list:        words_count = len(re.findall(word, urlSoup, re.IGNORECASE)) # remove re.IGNORECASE if you need exact casing        # or        # words_count = urlSoup.count(word) # exact casing        if words_count > 0:            print("The word " + word + " was found " + str(words_count) + " times in " + url + ".")        else:            print("The word '" + word + "' was not found in the URL you provided.") # add encoding="utf-8" to fix file read           with open('soup.txt', 'r', encoding="utf-8") as f:    words_list = ['running', 'outdoors', 'outdoor', 'shoes', 'clothing', 'delivery']    find_words(words_list, f.read())

Advertisement

Answer