Web Crawling libraries
Showing projects tagged as Web Crawling
-
Crawlee
7.9 9.8 TypeScriptCrawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation. -
fastimage
1.6 7.1 L4 TypeScriptA module that finds the size and type of an image by fetching and reading as little data as needed.
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.