Skip to main content
The 2024 Developer Survey results are live! See the results

Questions tagged [web-scraping]

Web scraping is the process of extracting specific information from websites that do not readily provide an API or other methods of automated data retrieval. Questions about "How To Get Started With Scraping" (e.g. with Excel VBA) should be *thoroughly researched* as numerous functional code samples are available. Web scraping methods include 3rd-party applications, development of custom software, or even manual data collection in a standardized way.

0 votes
0 answers
14 views

Html not loaded even after window.addEventListener('load', do_something) is invoked

I'm working on a browser extension that scrapes data from a webpage. I've added a method to scrape data, which is invoked via a load listener: window.addEventListener('load', scrapeSomething) However, ...
Tharun Reddy's user avatar
0 votes
0 answers
10 views

Can't scrape all the data from a lazy-loading table using Selenium

I'm trying to scrape three fields (player, logo, dkprice) from a table located in the middle of a webpage. To see all the data in that table, it is necessary to scroll down to the bottom of it. I've ...
SMTH's user avatar
  • 187
-1 votes
0 answers
22 views

Using Puphpeteer to scrape web page with no classes or ids [duplicate]

I'm helping a friend to migrate their blog from a "homemade" platform to Wordpress and the original developers have not been helpful in extracting the content so I'm trying to figure out how ...
Sanfly's user avatar
  • 1,019
0 votes
1 answer
9 views

Playwright Sync API inside the asyncio loop

So this is my first time with Playwright so I thought to try out the examples only to find none of the work and the errors dont make sense: I have tried all the examples in the docs and on the github ...
ArchaeonSeq's user avatar
0 votes
0 answers
21 views

Selenium Web Scraping on Google Maps: Clicking on all markers/points in view

I'm fairly new to Selenium and am looking to learn how to use it to click on all the markers or points on Google Maps in my view only. I’ve tried following some examples, but they either only work ...
i suck at programming's user avatar
0 votes
0 answers
8 views

How can I install html5lib on a dataproc cluster

I have a dataproc pipeline with which I do webscraping and store data in gcp. Task setting is something like this: create_dataproc_cluster = DataprocCreateClusterOperator( task_id='...
Sara 's user avatar
  • 55
0 votes
0 answers
13 views

What is the easiest way to save an captcha image from a .php page?

I've been trying to extract captcha images from a .php page but have had no luck so far, is there a simple(ish) way to do this? I've been trying with python with selenium so far and would like to keep ...
B G's user avatar
  • 1
-2 votes
0 answers
140 views

Scraping captcha from a website using selenium but the code won't produce an actual image

I'm trying to edit the code to save captchas in the hopes to eventually write a bot for automation. The following python code results in the subsequent error. import requests from selenium import ...
B G's user avatar
  • 1
0 votes
2 answers
48 views

findAll() returning empty outputs

I'm trying to scrape the title, date, rating and actual review of each reviews form mouthshut. But I'm unable to extract anything under the title of page. The review is in tag under class 'more ...
dooby's user avatar
  • 11
-1 votes
0 answers
29 views

bypass API key requirement

I'm trying to communicate to income tax portal (this website https://eportal.incometax.gov.in/iec/foservices/#/login) via Selenium-Python-Firefox and getting struck at this on opening API Key Required ...
Office's user avatar
  • 1
0 votes
0 answers
10 views

Extract leaflet marker coordinates by web scraping

I am trying to get the coordinates of a marker displayed on a leaflet map with my web scraper. However, I am unable to access these values. When exploring the page in my console, I stumbled on #...
ImRonny Dark's user avatar
0 votes
1 answer
41 views

Web scraping Images with beautiful soup Issue

I am trying to web scrape images of female buzzcuts and store it in a folder so that I could later use it to train a model. Yet, I am running into a problem where the code outputs "DONE", ...
Angel Rodriguez's user avatar
0 votes
0 answers
14 views

Instagram API endpoint to get ANY post details. In large scale

Looking for actual Instagram API endpoint (maybe private API) to get post details by post_id/short_code/media_id. I want to send many of requests hourly. Preferably without athentification. Was using ...
PATAPOsha's user avatar
  • 467
0 votes
0 answers
20 views

Selenium don't find class

I'm trying to obtain a value from Google Shopping and have attempted to use CSS, className, and XPath. However, nothing seems to work and it always returns an empty value. As you can see from the ...
Pablo Fernandes's user avatar
0 votes
1 answer
18 views

Extract stock market data from yahoo finance

I need to extract the symbols from the stocks that appear in this platform: https://finance.yahoo.com/markets/stocks/most-active/?start=0&count=100 I'm using R to achieve the aforthmentioned using ...
HMC's user avatar
  • 1

15 30 50 per page
1
2 3 4 5
3400