docs: Add Scrapling guide by vdusek · Pull Request #938 · apify/apify-sdk-python

vdusek · 2026-06-05T08:40:56Z

Adds a guide for the Scrapling adaptive web scraping library in Apify Actors, following the structure of the existing scraping-library guides.

docs/03_guides/07_scrapling.mdx — the guide: introduction & features, choosing a fetcher (HTTP vs. browser-based), a runnable example Actor, Apify Proxy integration, and running browser fetchers (DynamicFetcher/StealthyFetcher) with the required scrapling install step in the Dockerfile.
code/07_scrapling.py — runnable single-file example: a recursive title scraper using Scrapling's async HTTP AsyncFetcher through Apify Proxy. code/07_scrapling_browser.py shows the browser-based variant.
Quick-start guides list updated.

Verified locally (apify run) and on the Apify platform (build + run SUCCEEDED, correct dataset output via Apify Proxy), including the browser path. Lint + type-check pass.

Closes: #836

TODO before merging

Clone the guide content (docs/03_guides/07_scrapling.mdx + docs/03_guides/code/07_scrapling.py + docs/03_guides/code/07_scrapling_browser.py) into website/versioned_docs/version-3.4/ so it also shows in the current docs version, not only under "next".

codecov · 2026-06-05T08:44:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 86.95%. Comparing base (24a6edb) to head (55ad62a).
⚠️ Report is 6 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #938   +/-   ##
=======================================
  Coverage   86.94%   86.95%           
=======================================
  Files          48       48           
  Lines        2942     2943    +1     
=======================================
+ Hits         2558     2559    +1     
  Misses        384      384

Flag	Coverage Δ
e2e	`?`
integration	`59.08% <ø> (+0.18%)`	⬆️
unit	`75.70% <ø> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Mantisus

A couple of questions that might change the guide

Mantisus · 2026-06-07T19:13:54Z

+) -> tuple[dict[str, Any], list[str]]:
+    """Fetch a page in a real browser with Scrapling and return data and links."""
+    # `network_idle` waits until the page stops making network requests.
+    response = await DynamicFetcher.async_fetch(


How does this work internally? Does the browser open, send a request, and then close? If so, it looks like an overhead for a guide example.

Mantisus · 2026-06-07T19:18:12Z

+) -> tuple[dict[str, Any], list[str]]:
+    """Fetch a page with Scrapling's HTTP fetcher and return data and links."""
+    # `impersonate` and `stealthy_headers` make the request look like Chrome.
+    response = await AsyncFetcher.get(


The guide is titled 'Adaptive Scraping with Scrapling'. Should we use the 'adaptive=True' mode in the example? 🙂

https://scrapling.readthedocs.io/en/latest/parsing/adaptive.html

Mantisus · 2026-06-07T19:32:41Z

+    {ScraplingBrowserScraper}
+</CodeBlock>
+
+To run this on the Apify platform, build on top of the [Apify Playwright base image](https://hub.docker.com/r/apify/actor-python-playwright), which already ships a browser together with all of its system-level dependencies, and run `scrapling install` during the Docker build to download the browser binaries that Scrapling expects.


We can add a simple Dockerfile example here.

docs: Add Scrapling guide

54e153d

vdusek added adhoc Ad-hoc unplanned task added during the sprint. t-tooling Issues with this label are in the ownership of the tooling team. labels Jun 5, 2026

vdusek self-assigned this Jun 5, 2026

github-actions Bot added this to the 142nd sprint - Tooling team milestone Jun 5, 2026

docs: Split Scrapling guide example into modules and use code tabs

29c4c8a

vdusek marked this pull request as ready for review June 5, 2026 09:54

vdusek requested a review from szaganek as a code owner June 5, 2026 09:54

vdusek requested a review from Mantisus June 5, 2026 09:54

docs: use Request.crawl_depth for depth tracking in Scrapling example

2a41a3f

This was referenced Jun 5, 2026

docs: Add guide on validating Actor input with Pydantic #941

Open

docs: Add Crawl4AI guide #942

Open

docs: Add Browser Use guide #943

Open

vdusek added 2 commits June 5, 2026 20:45

docs: renumber Scrapling guide to 07 and switch to a single-file example

910df14

chore: drop unused ruff ignore for the removed Scrapling project

404bdfb

Mantisus suggested changes Jun 7, 2026

View reviewed changes

docs: reduce clause-gluing dashes in the Scrapling guide

55ad62a

vdusek mentioned this pull request Jun 8, 2026

docs: Retitle and streamline the existing guides #939

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Add Scrapling guide#938

docs: Add Scrapling guide#938
vdusek wants to merge 6 commits into
masterfrom
docs/scrapling-guide

vdusek commented Jun 5, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

Mantisus left a comment

Uh oh!

Mantisus Jun 7, 2026

Uh oh!

Mantisus Jun 7, 2026

Uh oh!

Mantisus Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vdusek commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO before merging

Uh oh!

codecov Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Mantisus left a comment

Choose a reason for hiding this comment

Uh oh!

Mantisus Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

Mantisus Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

Mantisus Jun 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vdusek commented Jun 5, 2026 •

edited

Loading

codecov Bot commented Jun 5, 2026 •

edited

Loading