
Should I choose Apify Starter or Scale if I need multiple concurrent runs and daily scheduling?
When you’re planning multiple concurrent Actor runs plus daily scheduling, the choice between Apify Starter and Scale comes down to how much concurrency, volume, and headroom you need today—and how often you want to think about limits.
Quick Answer: If you just need a few concurrent runs and light daily schedules for internal or prototype workloads, Apify Starter is usually enough. If you’re running production pipelines, multiple scheduled Actors, or anything customer-facing, go with Scale so concurrency and usage spikes don’t become your bottleneck.
The Quick Overview
- What It Is: A comparison of Apify’s Starter vs Scale plans for workloads that require multiple concurrent runs and scheduled daily jobs.
- Who It Is For: Engineers, data teams, and product builders who already know they’ll run more than one Actor at a time and want reliable daily (or more frequent) schedules.
- Core Problem Solved: Picking a plan that won’t throttle your crawlers, break your daily data pipelines, or force you into constant quota micromanagement.
How It Works
On Apify, the core unit is an Actor: a deployable web scraping or automation app you can run, schedule, and integrate via API. Every time you trigger an Actor (manually, via API, or via a schedule), you create a run that consumes platform resources (compute units, storage, proxies, unblocking, etc.).
Plans differ by:
- How many runs you can execute concurrently.
- How much usage you get (credits/compute).
- What happens when you hit those limits.
For a workload that needs multiple concurrent runs and reliable daily scheduling, you want a plan that:
- Allows several runs in parallel without queueing for hours.
- Lets you define multiple schedules (e.g., daily, hourly, or per region).
- Handles occasional spikes (more pages than usual, extra runs, retries) without erroring out or stalling runs.
At a high level, the decision process looks like this:
- Estimate your baseline load: How many Actors, how many schedules, how often, and average run duration.
- Translate that into concurrency needs: How many of those runs overlap in time during your “busy window.”
- Pick the plan that comfortably covers the peak: Not just the average, so a slightly heavier day doesn’t take the whole pipeline down.
Below I’ll break down how to think through this in terms of Starter vs Scale.
How Starter vs Scale behaves for concurrent runs and scheduling
Because plan details can change over time, treat this as a decision framework rather than hard numbers. Always cross-check the exact limits on the current Apify pricing page.
Typical “Starter” profile:
- Designed for individuals, prototypes, and light workloads.
- Enough concurrency for:
- A handful of Actors.
- A couple of daily schedules.
- On-demand test runs during the workday.
- Will start to feel constrained when:
- You want several Actors all running at the same time (e.g., multiple sites, or multiple region variants).
- You run jobs with longer runtimes (e.g., full-site crawls, large social graphs).
- You need to stack scheduled runs (daily + hourly + ad-hoc debugging).
Typical “Scale” profile:
- Designed for teams and production loads.
- Concurrency and quotas sized for:
- Multiple scheduled Actors per environment (prod/staging).
- High-frequency schedules (e.g., hourly) plus daily full refreshes.
- Parallelism across data sources (e.g., Google Maps + TikTok + e-commerce).
- Much more forgiving with spikes:
- If one day your sites double in size, or retries increase due to blocking, you’re much less likely to hit hard limits.
From experience moving price-intelligence crawlers from a homegrown stack to Apify:
- Starter works well when your Actor runs are short (minutes) and you can serialize most of them.
- As soon as you have:
- multiple stakeholders,
- multiple external consumers of the data (e.g., dashboards, LLM pipelines),
- or SLAs attached to “data must be fresh by 8:00 AM,” you want Scale so concurrency isn’t a constant firefight.
Features & Benefits Breakdown
Here’s how the decision usually plays out in practice.
| Core Feature | What It Does | Primary Benefit |
|---|---|---|
| Concurrent Actor runs | Defines how many Actor runs can execute in parallel before new runs are queued. | Ensures your daily schedules and API-triggered runs don’t block each other. |
| Flexible scheduling | Lets you schedule Actors to run daily, hourly, or at custom CRON expressions. | Keeps your data pipelines reliable without manual intervention. |
| Usage headroom for spikes | Provides enough credits/compute for unexpected peaks (extra pages, retries, new Actors). | Prevents failed runs and missed SLAs when sites change or traffic grows. |
On both Starter and Scale you still get the full Apify platform:
- Actors: Pre-built in the Apify Store or your own custom ones.
- Infra handled: Proxies, unblocking, cloud deployment, monitoring, and data processing handled by Apify.
- Datasets & integration: Export JSON/CSV/Excel, or consume via Apify API, Python/JS SDKs, webhooks, or tools like Zapier, Google Sheets, Airbyte, Slack, and even MCP clients.
The difference is how comfortably your concurrency and scheduling fit inside the plan.
Ideal Use Cases
-
Best for “just testing” or narrow internal tools: Starter
Because it can comfortably handle:- 1–3 main Actors.
- One or two daily schedules.
- Occasional manual/API-triggered runs during the day.
Example: a small internal dashboard that refreshes competitor pricing once per day, plus a couple of manual runs when product managers ask for “extra checks.”
-
Best for production pipelines and multiple data sources: Scale
Because it supports:- Several Actors running concurrently across multiple sites/regions.
- Multiple daily or hourly schedules that overlap.
- Continuous integration with downstream tools (BI, vector databases, RAG apps).
Example: a pipeline that: - Crawls 5–10 e‑commerce sites,
- Updates Google Maps leads daily,
- Refreshes content via Website Content Crawler to feed a vector DB like Pinecone for your RAG pipeline, all running overnight and throughout the day without stepping on each other.
Limitations & Considerations
-
Starter’s concurrency can become a bottleneck quickly:
If your daily run window is tight (e.g., you want all data updated between 02:00–04:00) and your jobs are longer-running, a lower concurrency limit means some jobs may still be queued when your business day starts. The workaround is to:- Stagger schedules,
- Shorten individual runs (e.g., partial crawls),
- Or upgrade to Scale.
-
Scale is the safer choice for “unknown but likely to grow” workloads:
If you’re not sure about your final scale but you know:- More sources will be added,
- More teams will depend on the data,
- Or you plan to expose this data to customers or AI features,
starting on Starter often leads to an upgrade later, sometimes after painful run failures. In those cases, it’s more efficient to start on Scale and avoid redesigning your scheduling patterns.
Pricing & Plans
Apify’s exact quotas and pricing can change, so always verify on the official pricing page. Conceptually:
- Starter: Lower monthly cost, lower concurrency and usage limits. Best for individuals, PoCs, small internal tools, and low-frequency schedules.
- Scale: Higher monthly cost, higher concurrency and usage limits, plus more breathing room for spikes. Best for teams, production pipelines, and multi-source data projects.
Rule-of-thumb from running production crawlers:
-
If hitting a limit would block your team or break a user-facing feature, treat that workload as Scale territory.
-
If limits are just an occasional annoyance and your runs are not time-critical, Starter is likely fine.
-
Starter: Best for solo builders or small teams needing a few daily schedules and occasional concurrent runs, while staying budget-conscious.
-
Scale: Best for data and product teams needing consistent concurrency for multiple Actors, overlapping schedules, and production-grade reliability.
Frequently Asked Questions
Can I run multiple Actors at the same time on Starter?
Short Answer: Yes, but with tighter concurrency limits than Scale.
Details:
On Starter, you can absolutely have more than one Actor running at the same time, especially if they’re short-lived or scheduled at different times of day. The practical limit is how many runs can be active before new ones are queued. If you have:
- Several daily schedules that all fire around the same time.
- On-demand runs triggered via API.
- Longer-running crawls (full-site scans, large social scrapes).
then you’ll start to feel the concurrency ceiling. If you see runs frequently stuck in READY/queued state for long periods, or your scheduled jobs finish later than you want, that’s a strong signal you’ve outgrown Starter and should move to Scale.
I only need one daily schedule now, but expect more in a few months. Should I start on Starter or Scale?
Short Answer: If your current pipeline isn’t business-critical yet, start on Starter; if this data will feed production features or AI workflows soon, start on Scale.
Details:
If you’re still experimenting—trying out Actors from the Apify Store, validating one or two data sources, or prototyping an LLM/RAG pipeline—Starter is a sensible landing spot. You can:
- Schedule one or two key Actors daily.
- Manually or programmatically run others as needed.
- Understand your real runtime patterns and concurrency needs.
But if the roadmap clearly includes:
- Multiple sources,
- Multiple daily/hourly schedules,
- Integration with production systems (e.g., BI dashboards, internal apps, or AI agents),
then starting directly on Scale saves you from reworking schedules and dealing with unexpected limits later, right when stakeholders start depending on the data.
Summary
If your primary requirement is “multiple concurrent runs and daily scheduling,” you’re already on the boundary where Starter may work technically, but Scale will work operationally.
Use this rule-of-thumb:
-
Choose Starter if:
- You have a small number of Actors,
- One or two daily schedules,
- No hard deadlines for when data must be ready,
- And you’re still validating your use case.
-
Choose Scale if:
- You run several Actors in parallel,
- Have overlapping schedules (daily + hourly + per-region),
- Or your data feeds production systems, dashboards, or AI/RAG workflows that need predictable freshness.
If you’re on the fence, you’re usually closer to needing Scale than you think—especially once more teams discover your dataset.