Web Scraping for Python using Beautiful Soup
Learn web scraping with Python using Beautiful Soup. Understand how to extract and process data from websites, automate the collection of structured information, and store it for analysis using Python’s powerful scraping and parsing tools.
At a Glance
Data is the fuel of Data Science. We can get data from databases and other data repositories. A lot of data is published as web pages. Web scraping is the process of harvesting data from web pages. BeautifulSoup is a Python library that allows for web scraping, parsing, and extracting data from HTML and XML documents. In this guided project, you will use BeautifulSoup to scrape the contents of a web page.
A Look at the Project Ahead
- Create a BeautifulSoup object
- Extract information from HTML files
- Download and scrape the contents of a web page
What You’ll Need
- Familiarity with Python fundamentals
- Familiarity with the basics of HTML
- A web browser
IBM Skills Network Labs will provide you with everything you need to complete this project. However, if you are serious about Data Science, you should give IBM Watson® Studio a try. IBM Watson® Studio empowers data scientists, developers, and analysts to build, run and manage AI models, and optimize decisions anywhere on IBM Cloud Pak® for Data. Unite teams, automate AI lifecycles, and speed time to value on an open multi-cloud architecture. Get started with the IBM Watson Studio free of charge.
There are no reviews yet.