Datasets

Learn about the different datasets available in TheirStack.

Datasets page

What are datasets?

Datasets give you direct access to our complete database of jobs, technology usage, and company information. Think of them as your raw data goldmine – perfect when you need maximum flexibility for advanced analytics, machine learning projects, or custom analysis.

We've got two powerful datasets ready for you:

  • Jobs Dataset: Access our complete collection of 120M+ job postings from 195 countries, dating back to 2021. Each job record includes essential details like job title, description, salary, location, company name, company URL, company industry, company size... Check out the complete data dictionary for all available fields.

  • Technographics Dataset: Dive into 41M+ technology usage signals across 7.2M companies using 21,000+ different technologies. This isn't just one file – you'll get three comprehensive datasets: the main technographics data (company_id, technology_slug, confidence_score, n_jobs...), detailed company profiles (including domain, country, revenue, and employee count), and a complete technology catalog with descriptions and categories. Check out the complete data dictionary for all available fields.

Frequency of updates

TheirStack offers three flexible dataset access options to meet your data needs:

  • Historical Access: Receive a one-time download link containing all available records at the time of purchase.
  • Daily Updates: Get daily download links with new records added to the dataset, including delta files for incremental updates.
  • Complete Access: Combine both historical and daily updates for comprehensive data coverage.

Delivery format

All datasets are delivered with a link to a S3 bucket. The format of the dataset can be CSV or Parquet.

Data structure

You can access to the dictionary of the dataset in the app.

How to access datasets

You can access and manage datasets through the TheirStack App. The dataset page provides comprehensive information and tools:

  • Last update date
  • Total record count
  • Data dictionary
  • Sample data download
  • Dictionary download
  • Full dataset download (requires access)
  • Data refresh frequency

Datasets page

In order to get the latest dataset link to downlaad, you need to do a /GET request to the dataset endpoint.

Frequently Asked Questions

How is this guide?

Last updated on