Crawlee v3.0.2 Release Notes

Release Date: 2022-07-28 // over 1 year ago
  • 🛠 Fixes

    • regression in resolving the base url for enqueue link filtering (1422)
    • 👌 improve file saving on memory storage (1421)
    • ➕ add UserData type argument to CheerioCrawlingContext and related interfaces (1424)
    • always limit desiredConcurrency to the value of maxConcurrency (bcb689d)
    • wait for storage to finish before resolving crawler.run() (9d62d56)
    • using explicitly typed router with CheerioCrawler (07b7e69)
    • 📦 declare dependency on ow in @crawlee/cheerio package (be59f99)
    • 👉 use crawlee@^3.0.0 in the CLI templates (6426f22)
    • 🛠 fix building projects with TS when puppeteer and playwright are not installed (1404)
    • enqueueLinks should respect full URL of the current request for relative link resolution (1427)
    • 0️⃣ use desiredConcurrency: 10 as the default for CheerioCrawler (1428)

    🔋 Features

    • 🔧 feat: allow configuring what status codes will cause session retirement (1423)
    • 👍 feat: add support for middlewares to the Router via use method (1431)