How to Web Scrape Stock Market Data With Python (Easy Guide)

Zoltan Bettenbuk
May 5, 2025

Whether you’re an investor tracking your financial portfolio performance or an investment firm looking for new investment opportunities, web scraping stock market data lets you access the latest financial information and trends to support your needs.

In this web scraping tutorial, we’ll show you how to build a web scraping tool to extract stock market data using Python and BeautifulSoup. Additionally, the example will teach you how to track and extract multiple stock prices and organize them in a CSV file for further analysis. Let’s jump right in.

How to Build a Stock Market Data Scraper

For this exercise, we’ll be scraping investing.com to extract up-to-date stock prices from Microsoft, Coca-Cola, and Nike, and storing it in a CSV file. We’ll also show you how to protect your web-scraping bot from being blocked by anti-scraping mechanisms and techniques using ScraperAPI.

Note: The script will work to scrape stock market data even without ScraperAPI, but will be crucial for scaling your project later.

Although we’ll be walking you through every step of the stock market data extraction process, having some knowledge of the Beautiful Soup library beforehand is helpful. If you’re totally new to this library, check out our beautiful soup tutorial for beginners. It’s packed with tips and tricks, and goes over the basics you need to know to scrape almost anything.

With that out of the way, let’s jump into the code so you can learn how to scrape stock market data.

1. Setting Up Our Stock Market Web Scraping Project

To begin, we’ll create a folder named “scraper-stock-project”, and open it from VScode (you can use any text editor you’d like). Next, we’ll open a new terminal and install our two main dependencies for this project:

pip3 install bs4
pip3 install requests

After that, we’ll create a new file named “stockData-scraper.py” and import our dependencies to it.

</p>
<pre>import requests
from bs4 import BeautifulSoup</pre>
<p>

With Requests, we’ll be able to send an HTTP request to download the HTML file which is then passed on to BeautifulSoup for parsing. So let’s test it by sending a request to Nike’s stock page:

</p>
<pre>url = 'https://www.investing.com/equities/nike'
page = requests.get(url)

print(page.status_code)</pre>
<p>

By printing the status code of the page variable (which is our request), we’ll know for sure whether or not we can scrape the page. The code we’re looking for is a 200, meaning it was a successful request.

Success! Before moving on, we’ll pass the response stored in page to Beautiful Soup for parsing:

</p>
<pre>soup = BeautifulSoup(page.text, 'html.parser')</pre>
<p>

You can use any parser you want, but we’re going with html.parser because it’s the one we like.

2. Inspect the Website’s HTML Structure (Investing.com)

Before we start scraping, let’s open https://www.investing.com/equities/nike in our browser to get more familiar with the website.

As you can see in the screenshot above, the page displays the company’s name, stock symbol, price, and price change. At this point, we have three questions to answer:

Is the data being injected with JavaScript?
What attribute can we use to select the elements?
Are these attributes consistent throughout all pages?

Check for JavaScript

There are several ways to verify if some script is injecting a piece of data, but the easiest way is to right-click, View Page Source.

It looks like there isn’t any JavaScript that could potentially interfere with our scraper. Next we’ll do the W same for the rest of the information. We didn’t find any additional JavaScript we’re good to go.

Note: Checking for JavaScript is important because Requests can’t execute JavaScript or interact with the website, so if the information is behind a script, we would have to use other tools to extract it, like Selenium.

Picking the CSS Selectors

Now let’s inspect the HTML of the site to identify the attributes we can use to select the elements.

Extracting the company’s name and the stock symbol will be a breeze. We just need to target the H1 tag with class ‘text-2xl font-semibold instrument-header_title__GTWDv mobile:mb-2’.

However, the price, price change, and percentage change are separated into different spans.

What’s more, depending on whether the change is positive or negative, the class of the element changes, so even if we select each span by using their class attribute, there will still be instances when it won’t work.

The good news is that we have a little trick to get it out. Because Beautiful Soup returns a parsed tree, we can now navigate the tree and pick the element we want, even though we don’t have the exact CSS class.

What we’ll do in this scenario is go up in the hierarchy and find a parent div we can exploit. Then we can use find_all(‘span’) to make a list of all the elements containing the span tag – which we know our target data uses. And because it’s a list, we can now easily navigate it and pick those we need.

So here are our targets:

</p>
<pre>company = soup.find('h1', {'class': 'text-2xl font-semibold instrument-header_title__GTWDv mobile:mb-2'}).text
price = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[0].text
change = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[2].text</pre>
<p>

Now for a test run:

</p>
<pre>print('Loading: ', url)
print(company, price, change)</pre>
<p>

And here’s the result:

3. Scrape Multiple Financial Stock Data

Now that our parser is working, let’s scale this up and scrape several stocks. After all, a script for tracking just one stock data is likely not going to be very useful.

We can make our scraper parse and scrape several pages by creating a list of URLs and looping through them to output the data.

</p>
<pre>urls = [
'https://www.investing.com/equities/nike',
'https://www.investing.com/equities/coca-cola-co',
'https://www.investing.com/equities/microsoft-corp',
]

for url in urls:
page = requests.get(url)
soup = BeautifulSoup(page.text, 'html.parser')

company = soup.find('h1', {'class': 'text-2xl font-semibold instrument-header_title__GTWDv mobile:mb-2'}).text
price = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[0].text
change = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[2].text

print('Loading: ', url)
print(company, price, change)</pre>
<p>

Here’s the result after running it:

Awesome, it works across the board!

We can keep adding more and more pages to the list but eventually, we’ll hit a big roadblock: anti-scraping techniques.

4. Integrating ScraperAPI’s Web Scraping Tools to Handle IP Rotation and CAPCHAs

Not every website likes to be scraped, and for a good reason. When scraping a website, we need to have in mind that we are sending traffic to it, and if we’re not careful, we could be limiting the bandwidth the website has for real visitors, or even increasing hosting costs for the owner. That said, as long as we respect web scraping best practices, we won’t have any problems with our projects, and we won’t cause the sites we’re scraping any issues.

However, it’s hard for businesses to differentiate between ethical scrapers and those that will break their sites. For this reason, most servers will be equipped with different systems like

Browser behavior profiling
CAPTCHAs
Monitoring the number of requests from an IP address in a time period

These measures are designed to recognize bots, and block them from accessing the website for days, weeks, or even forever.

Instead of handling all of these scenarios individually, we’ll just add two lines of code to make our requests go through ScraperAPI’s servers and get everything automated for us.

First, let’s create a free ScraperAPI account to access our API key and 5000 free API credits for our project.

Now we’re ready to add to our loop a new params variable to store our key and target URL and use urlencode to construct the URL we’ll use to send the request inside the page variable.

</p>
<pre> params = {'api_key': 'YOUR_API_KEY', 'url': url}
page = requests.get('http://api.scraperapi.com/', params=urlencode(params))</pre>
<p>

Oh! And we can’t forget to add our new dependency to the top of the file:

</p>
<pre>from urllib.parse import urlencode</pre>
<p>

Every request will now be sent through ScraperAPI, which will automatically rotate our IP after every request, handle CAPCHAs, and use machine learning and statistical analysis to set the best headers to ensure success.

Quick Tip: ScraperAPI also allows us to scrape a dynamic site by setting ‘render’: true as a parameter in our params variable. ScraperAPI will render the page before sending back the response.

5. Store The Extracted Financial Data In a CSV File

To store your data in an easy-to-use CSV file, simply add these three lines between your URL list and your loop:

</p>
<pre>file = open('stockprices.csv', 'w')
writer = csv.writer(file)
writer.writerow(['Company', 'Price', 'Change'])=</pre>
<p>

This will create a new CSV file and pass it to our writer (set in the writer variable) to add the first row with our headers.

It’s essential to add it outside of the loop, or it will rewrite the file after scraping each page, basically erasing previous data and giving us a CSV file with only the data from the last URL from our list.

In addition, we’ll need to add another line to our loop to write the scraped data:

</p>
<pre>writer.writerow([company.encode('utf-8'), price.encode('utf-8'), change.encode('utf-8')])</pre>
<p>

And one more outside the loop to close the file:

</p>
<pre>file.close()</pre>
<p>

6. Finished Code: Stock Market Data Web Scraper Script

You’ve made it! You can now use this script with your own API key and add as many stocks as you want to scrape:

</p>
<pre>#dependencies
import requests
from bs4 import BeautifulSoup
import csv
from urllib.parse import urlencode

#list of URLs
urls = [
'https://www.investing.com/equities/nike',
'https://www.investing.com/equities/coca-cola-co',
'https://www.investing.com/equities/microsoft-corp',
]

#starting our CSV file
file = open('stockprices.csv', 'w')
writer = csv.writer(file)
writer.writerow(['Company', 'Price', 'Change'])

#looping through our list
for url in urls:

#sending our request through ScraperAPI
params = {'api_key': 'YOUR_API_KEY', 'url': url}
page = requests.get('http://api.scraperapi.com/', params=urlencode(params))

#our parser
soup = BeautifulSoup(page.text, 'html.parser')
company = soup.find('h1', {'class': 'text-2xl font-semibold instrument-header_title__GTWDv mobile:mb-2'}).text
price = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[0].text
change = soup.find('div', {'class': 'instrument-price_instrument-price__3uw25 flex items-end flex-wrap font-bold'}).find_all('span')[2].text

#printing to have some visual feedback
print('Loading :', url)
print(company, price, change)

#writing the data into our CSV file
writer.writerow([company.encode('utf-8'), price.encode('utf-8'), change.encode('utf-8')])

file.close()</pre>
<p>

Consider When to Run Your Stock Market Data Scraper

You need to remember that the stock market isn’t always open. For example, if you’re scraping data from NYC’s stock exchange, it closes at 5 pm EST on Fridays and opens on Monday at 9:30 am. So there’s no point in running your scraper over the weekend. It also closes at 4 pm so that you won’t see any changes in the price after that.

Another variable to keep in mind is how often you need to update the data. The most volatile times for the stock exchange are opening and closing times. So it might be enough to run your script at 9:30 am, at 11 am, and at 4:30 pm to see how the stocks closed. Monday’s opening is also crucial to monitor as many trades occur during this time.

Unlike other markets like Forex, the stock market typically doesn’t make too many crazy swings. That said, oftentimes news and business decisions can heavily impact stock prices – take Meta shares crash or the rise of GameStop share price as examples – so reading the news related to the stocks you are scraping is vital.

Easily Schedule Web Scraping Stock Market Data with ScraperAPI’s

We hope this tutorial helped you build your own stock market data scraper or at least point you in the right direction.

If you’re looking for an automated data scraping solution, ScraperAPI’s DataPipeline is a great option. It makes scheduling your stock market data scraping projects easy, so you don’t have to worry about different time zones. Simply set your desired scheduling time, and ScraperAPI’s web scraping tool will automatically run, delivering the latest financial data in structured JSON format right to you. Sign up here to start with a 7-day free trial.

Until next time, happy scraping!

FAQs on Stock Market and Investment Platform Data Extraction

Answering some common questions about scraping stock market pricing and product data.

1. Is It Legal to Scrape Stock and Other Investment Product Data?

Generally, scraping stock market data is legal as long as the information is publicly available. This means you don’t need to sign in or go through a paywall to access the data. However, double-checking each investment platform’s terms and conditions before you scrape their stock data is still highly recommended to ensure you don’t break any law. This guide explains in detail the legalities of web scraping.

2. Where Can I Scrape Reliable Stock Market Data?

Yahoo Finance, Google Finance, Investing.com, and Bloomberg are some websites and investment platforms that provide reliable stock market data for the public.

3. How Do I Make My Web Scraper Automatically Extract Stock Market Information?

To automatically scrape stock market data, leverage ScraperAPI’s Data Pipeline solution. ScraperAPI’s web scraping tool allows you to schedule your web scraper to collect specific stock market datasets (up to 10,000 URLs) at specific times without the need to write a single code. Learn more about ScraperAPI’s Data Pipeline feature.

About the author

Zoltan Bettenbuk

Zoltan Bettenbuk is the CTO of ScraperAPI - helping thousands of companies get access to the data they need. He’s a well-known expert in data processing and web scraping. With more than 15 years of experience in software development, product management, and leadership, Zoltan frequently publishes his insights on our blog as well as on Twitter and LinkedIn.

Scraping billions of pages?

We’ll customize your concurrency, speed, and extended trial — for high-volume scraping.

Contact Sales

Tutorial on How to scrape AI Snippets in Google Search Engine Results Pages

How to Scrape AI Overviews in Google Search Results

If you’ve ever searched for something on Google and noticed a helpful AI-generated summary at the top of the results, you’ve encountered Google’s AI overviews.

Read article

November 25, 2024

How to Bypass and Scrape Amazon WAF Bot Control with Python

When scraping data from the web, one of the toughest challenges you’ll face is bot protection systems like AWS WAF Bot Control. It is widely

Read article

November 25, 2024

Safe Proxies for Financial Data Aggregation

Alternative financial data (alt-data) has become the mainstream for companies making strategic financial decisions nowadays. It goes beyond traditional data sources like company filings, broker

Read article

November 18, 2024

Need More Than 3M API Credits per Month?

Talk to an expert and learn how to build a scalable scraping solution.

Async Scraper Service

Structured Data

DataPipeline

Scraping API

Large-Scale Data Acquisition

Ecommerce

Market Research Firms

SEO Agencies

Travel Agencies and Hotels

VCs and Hedge Funds

AI and ML

SERP Data Collection

Ecommerce Data Collection

Market Research Scraper

Real Estate Data Collection

Online Reputation Management

cURL

Python

NodeJS

PHP

Ruby

Java

DataPipeline

LangChain Integration

Developer Guides

Free Downloads

Product FAQs

Case Studies

Webinars

Comparisons

Learning Hub

Glossary

Blog

Async Scraper Service

Structured Data

DataPipeline

Scraping API

Large-Scale Data Acquisition

Ecommerce

Market Research Firms

SEO Agencies

Travel Agencies and Hotels

VCs and Hedge Funds

AI and ML

SERP Data Collection

Ecommerce Data Collection

Market Research Scraper

Real Estate Data Collection

cURL

Python

NodeJS

PHP

Ruby

Java

DataPipeline

Developer Guides

Free Downloads

Product FAQs

Case Stuides

Webinars

Comparisons

Learning Hub

Glossary

Blog

How to Web Scrape Stock Market Data With Python (Easy Guide)

How to Build a Stock Market Data Scraper

1. Setting Up Our Stock Market Web Scraping Project

2. Inspect the Website’s HTML Structure (Investing.com)

Check for JavaScript

Picking the CSS Selectors

3. Scrape Multiple Financial Stock Data

4. Integrating ScraperAPI’s Web Scraping Tools to Handle IP Rotation and CAPCHAs

5. Store The Extracted Financial Data In a CSV File

6. Finished Code: Stock Market Data Web Scraper Script

Consider When to Run Your Stock Market Data Scraper

Easily Schedule Web Scraping Stock Market Data with ScraperAPI’s

FAQs on Stock Market and Investment Platform Data Extraction

1. Is It Legal to Scrape Stock and Other Investment Product Data?

2. Where Can I Scrape Reliable Stock Market Data?

3. How Do I Make My Web Scraper Automatically Extract Stock Market Information?