Tag: web-scraping

How to access data-state in div id, cheerio node js

javascript node.js parsing puppeteer web-scraping

I am grateful to everyone who participates and will help a newbie. Task: Access div = client-state, then div = Here-Goes-Some-Div-ID and return json from data-state. I managed to refer to div = client state like this – Unfortunately, I did not find information on how to further access the “Here-Go…

Puppeteer not retrieving JavaScript rendered page

javascript puppeteer web-scraping

I am trying to load the product page using puppeteer but its not working. If we open this URL it will load the page half and when we scroll down it loads rest of the page. I tried using the scroll as well but it did not work. Scroll function is following Answer When I run this headfully, I don’t

Mouse click event not working as expected

dom javascript web-scraping

I’m trying to search for a contact in Whatsapp Web search bar, first I want to focus in search bar and then enter the contact to execute the search. I’m selecting the div with: Dispatch event returns true but has no effect in search box Answer You can just use element.focus() or element.select(), …

Accessing Data from Javascript API call [closed]

api javascript python python-requests web-scraping

Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed 8 months ago. Improve this question I am trying to access the data shown on this website: Link using either p…

Can I simplify this code to avoid the type error for reading properties?

javascript node.js puppeteer web-scraping

I am writing this code to scrape a webpage. I need to get specific information from the website and there is a lot of information needed to be scraped. The code that I write works but when do it repeatedly it encounters error on some of the line, e.g. line 20, line 24. Below is the code There are like

Puppeteer , bringing back blank array

javascript node.js puppeteer web web-scraping

I’m trying to grab products from ebay and open them on amazon. So far, I have them being searched on amazon but I’m struggling with getting the products selected from the search results. Currently its outputting a blank array and im not sure why. Have tested in a separate script without the grabTi…

Python Scraping JavaScript page without the need of an installed browser

javascript python selenium web-scraping

I’m trying to scrape an HTML element in a webpage. The content of this element are generated by Javascript and thus cannot be scraped by simply running a requests.GET: response = requests.get(url). I read in other posts that Selenium can be used to solve this issue, but it requires an actual browser ins…

Can’t Find Javascript href Link in Python Webscrape

javascript python selenium web-scraping

I’m trying to webscrape this site: https://www2.tse.or.jp/tseHpFront/JJK020010Action.do Using the Selenium package, with Google Chrome as my browser, I’m able to open it up, choose some settings, and then run a search. I’m encountering an error because there are 21 pages of information, and …

I’m trying to scrap data from a website and getting back basic HTML with JS function in the body

cheerio html javascript node.js web-scraping

Hi everyone, I’m playing around with Node.js and cheerio package as part of my node.js learning and im trying to build a web scrapper that will get the title and the price of an item from a shopping site but when I try to console.log the html variable it returns a basic html structure with some Js funct…

Puppeteer can’t find elements when Headless TRUE

headless-browser javascript puppeteer web-scraping

I’m facing some problems with Puppeteer, I want to extract a list of items and succeed when headless is FALSE but not when TRUE. First thing first, I want to get those elements before mapping on it. Here’s my script, maybe you can reproduce it, it is really basic. Answer For starters, I’d pr…