Skip to main content
GET
/
dca
/
dataset
cURL
curl --request GET \
  --url https://api.brightdata.com/dca/dataset \
  --header 'Authorization: Bearer <token>'
[
  {
    "Image": "https://targetwebsite.com/product_id.png",
    "Title": "product_name",
    "Price": "product_price",
    "input": {
      "url": "https://targetwebsite.com/product_id/"
    }
  }
]

Documentation Index

Fetch the complete documentation index at: https://docs.brightdata.com/llms.txt

Use this file to discover all available pages before exploring further.

Download the dataset produced by a triggered Bright Data Scraper Studio collection. The same endpoint serves both the in-progress response (a status object) and the ready response (a JSON array of records). For the polling pattern in cURL, Python and Node.js, see the Quickstart.
Result data is available for download for 16 days after collection. To avoid expiration, download the data within 16 days or configure a push delivery method to send it to your storage automatically.

Request

curl "https://api.brightdata.com/dca/dataset?id=$COLLECTION_ID" \
  -H "Authorization: Bearer $BRIGHT_DATA_API_TOKEN"

Response

While the collection is still building (HTTP 202):
{
  "status": "building",
  "message": "Dataset is not ready yet, try again in XXs"
}
When the collection is ready (HTTP 200):
[
  {
    "url": "https://www.dm.de/p/d/3133774/babylove-teller-silikon-mit-trennschale-regenbogen-orange-creme",
    "title": "babylove Teller Silikon mit Trennschale Regenbogen orange/creme",
    "price": 8.45,
    "availability": "in stock",
    "input": {
      "url": "https://www.dm.de/p/d/3133774/babylove-teller-silikon-mit-trennschale-regenbogen-orange-creme"
    }
  }
]
The exact field set depends on the output schema you defined when you built the collector. One row per successful input by default.

Polling pattern

Poll at fixed five-second intervals until you receive a non-empty JSON array. The starter templates implement this with exponential-backoff retry on transient errors and a five-minute timeout default. See the Node.js or Python starter for the full implementation.

Errors

StatusCauseFix
401 UnauthorizedToken missing, malformed or revokedRe-copy from Account Settings → API Tokens
404 Not FoundCollection ID does not exist, was deleted or has expired (16 days)Re-trigger the collector if the data is still needed
[] (empty array)Collection completed but produced no rowsCheck the input URLs and the collector’s output schema
5xxTransient Bright Data API errorRetry with exponential backoff, for example 1s, 2s, 4s

Authorizations

Authorization
string
header
required

Use your Bright Data API Key as a Bearer token in the Authorization header.

How to authenticate:

  1. Obtain your API Key from the Bright Data account settings at https://brightdata.com/cp/setting/users
  2. Include the API Key in the Authorization header of your requests
  3. Format: Authorization: Bearer YOUR_API_KEY

Example:

Authorization: Bearer b5648e1096c6442f60a6c4bbbe73f8d2234d3d8324554bd6a7ec8f3f251f07df

Learn how to get your Bright Data API key: https://docs.brightdata.com/api-reference/authentication

Query Parameters

id
string
required

The ID of the dataset to retrieve.

Response

Dataset (Ready)