WebScraping: Pandas to_excel Not Displaying full DataFrame

Question

I am brand new to coding, and was given a web scraping tutorial (found here) to help build my skills as I learn. I've already had to make several adjustments to the code in this tutorial, but I digress. I'm scraping off of http://books.toscrape.com/ and, when I try to export a Dataframe of just the book categories into Excel, I

Accepted Answer

The code in question will not create any dataframe. However, you should select your elements more specific for example with css selectors:for a in soup.select('ul.nav-list a'):    if a.get_text(strip=True) not in results:        results.append(a.get_text(strip=True))Exampleimport requestsfrom bs4 import BeautifulSoupimport pandas as pdresults = []soup = BeautifulSoup(requests.get('http://books.toscrape.com/').content)for a in soup.select('ul.nav-list a'):    if a.get_text(strip=True) not in results:        results.append(a.get_text(strip=True))pd.DataFrame({'Categories': results})OutputCategories0Books1Travel2Mystery3Historical Fiction4Sequential Art5Classics6Philosophy&#8230;

	Categories
0	Books
1	Travel
2	Mystery
3	Historical Fiction
4	Sequential Art
5	Classics
6	Philosophy

Advertisement

Answer

Example

Output