All Questions
Tagged with python web-scraping
29,754
questions
0
votes
0
answers
10
views
Python async_playwright
Ich schreibe ein Programm was von der ilovemusic Website die Streams abspielen soll. Das funktioniert auch. Nur das Problem ist wenn ich die Scraper Klasse in der main.py aufrufe über scraper_task = ...
0
votes
1
answer
31
views
Can't scrape all the data from a lazy-loading table using Selenium
I'm trying to scrape three fields (player, logo, dkprice) from a table located in the middle of a webpage. To see all the data in that table, it is necessary to scroll down to the bottom of it.
I've ...
0
votes
1
answer
14
views
Playwright Sync API inside the asyncio loop
So this is my first time with Playwright so I thought to try out the examples only to find none of the work and the errors dont make sense:
I have tried all the examples in the docs and on the github ...
0
votes
0
answers
8
views
How can I install html5lib on a dataproc cluster
I have a dataproc pipeline with which I do webscraping and store data in gcp.
Task setting is something like this:
create_dataproc_cluster = DataprocCreateClusterOperator(
task_id='...
-2
votes
0
answers
166
views
Scraping captcha from a website using selenium but the code won't produce an actual image
I'm trying to edit the code to save captchas in the hopes to eventually write a bot for automation. The following python code results in the subsequent error.
import requests
from selenium import ...
0
votes
2
answers
50
views
findAll() returning empty outputs
I'm trying to scrape the title, date, rating and actual review of each reviews form mouthshut.
But I'm unable to extract anything under the title of page.
The review is in tag under class 'more ...
-2
votes
0
answers
30
views
bypass API key requirement
I'm trying to communicate to income tax portal (this website https://eportal.incometax.gov.in/iec/foservices/#/login) via Selenium-Python-Firefox and getting struck at this on opening
API Key Required
...
0
votes
1
answer
43
views
Web scraping Images with beautiful soup Issue
I am trying to web scrape images of female buzzcuts and store it in a folder so that I could later use it to train a model. Yet, I am running into a problem where the code outputs "DONE", ...
0
votes
0
answers
21
views
Selenium don't find class
I'm trying to obtain a value from Google Shopping and have attempted to use CSS, className, and XPath.
However, nothing seems to work and it always returns an empty value. As you can see from the ...
-2
votes
1
answer
49
views
trying to find out the logic of this page: approx ++ 100 results stored - and parsed with Python & BS4
trying to find out the logic that is behind this page:
we have stored some results in the following db:
https://www.raiffeisen.ch/rch/de/ueber-uns/raiffeisen-gruppe/organisation/raiffeisenbanken/...
0
votes
1
answer
30
views
trying to apply a bs4-approach to wikipedia-page: results do not store in a df
due to the fact that scraping on Wikipedia is a very very common technique - where we can use an appropiate approach to work with many many different jobs - i did have some issues with getting back ...
0
votes
0
answers
22
views
Selenium unable to click on element
I am working on a web scraping project using Selenium in Python and am encountering an issue while interacting with the Lufthansa homepage. My goal is to click a specific element (the Departure field) ...
-3
votes
0
answers
26
views
Extracting CEO Information [closed]
I am working on a project in which I have to Extract CEOs information (Their company, name, and tenure information) from last 25 years throughout the US and saving it into a CSV format for further ...
0
votes
1
answer
43
views
Why is my Selenium script only scraping data for 7 matches on F O R E B E T?
I'm working on a web scraping project using Selenium to scrape football match data from a sports prediction website ( lets use EXAMPLE for F O R E B E T) . However, my script only retrieves data for 7 ...
0
votes
0
answers
82
views
How to Resolve Verify Traffic Error When Scraping Data from Shopee?
I'm currently working on a project where I need to scrape product data from Shopee. I'm using Python with the requests and BeautifulSoup libraries. However, I keep encountering a Verify Traffic error ...