Skip to content

Sample dataset of 1001 Tokopedia products, extracted via Bright Data API, featuring essential data points for competitive analysis, consumer sentiment, product insights, pricing strategy, and more.

Notifications You must be signed in to change notification settings

luminati-io/Tokopedia-dataset-samples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

# Tokopedia-dataset-samples

A sample dataset of 1001 Tokopedia products

Tokopedia dataset header

A Tokopedia dataset sample of over 1000 records. Dataset was extracted using the Bright Data API.

Some of the data points that are included in the Tokopedia dataset:

  • product_id: Product id
  • title: Product's title
  • url: URL of the product listing
  • currency: Currency of the price
  • delivery: Product delivery details
  • final_price: Final price of the product
  • initial_price: Initial price of the product
  • seller_name: Product seller name
  • description: Description of the product
  • availability: Availability of the product
  • reviews_count: Number of reviews of the product
  • rating_count: Number of people who rated the product
  • rating: Product rating
  • discussion_count: Number of discussions of the product
  • categories: Product categories
  • images_count: Number of images of the product

And a lot more.

This is a sample subset which is derived from the "Tokopedia products" dataset which includes more than 1.2M records.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

Get the full Tokopedia dataset.

What are the Tokopedia datasets use cases?

1. Product Insights & Pricing Strategy

Discover Tokopedia's top-selling products to enhance your inventory, pricing, and supply chain strategies. Leverage insights on popular searches and high-demand items to fine-tune your marketing efforts and maintain a competitive edge in the market.

2. Competitive Analysis

Outpace Tokopedia’s top sellers by examining their product offerings, customer reviews, and promotional strategies. Uncover new business opportunities and gain insights to strengthen your marketplace presence.

3. Consumer Sentiment & Brand Trends

Keep track of trending categories and brands on Tokopedia to stay informed about changes in consumer demand. Analyze product popularity across regions to align your brand strategy with evolving buyer preferences.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.

About

Sample dataset of 1001 Tokopedia products, extracted via Bright Data API, featuring essential data points for competitive analysis, consumer sentiment, product insights, pricing strategy, and more.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published