Datasets
Learn about the different datasets available in TheirStack.
What are datasets?
Datasets give you direct access to our complete database of jobs, technology usage, and company information. Think of them as your raw data goldmine – perfect when you need maximum flexibility for advanced analytics, machine learning projects, or custom analysis.
We've got two powerful datasets ready for you:
-
Jobs Dataset: Access our complete collection of 120M+ job postings from 195 countries, dating back to 2021. Each job record includes essential details like job title, description, salary, location, company name, company URL, company industry, company size... Check out the complete data dictionary for all available fields.
-
Technographics Dataset: Dive into 41M+ technology usage signals across 7.2M companies using 21,000+ different technologies. This isn't just one file – you'll get three comprehensive datasets: the main technographics data (company_id, technology_slug, confidence_score, n_jobs...), detailed company profiles (including domain, country, revenue, and employee count), and a complete technology catalog with descriptions and categories. Check out the complete data dictionary for all available fields.
Frequency of updates
TheirStack offers three flexible dataset access options to meet your data needs:
- Historical Access: Receive a one-time download link containing all available records at the time of purchase.
- Daily Updates: Get daily download links with new records added to the dataset, including delta files for incremental updates.
- Complete Access: Combine both historical and daily updates for comprehensive data coverage.
Delivery format
All datasets are delivered with a link to a S3 bucket. The format of the dataset can be CSV or Parquet.
Data structure
You can access to the dictionary of the dataset in the app.
How to access datasets
You can access and manage datasets through the TheirStack App. The dataset page provides comprehensive information and tools:
- Last update date
- Total record count
- Data dictionary
- Sample data download
- Dictionary download
- Full dataset download (requires access)
- Data refresh frequency
How to get lastest dataset link
In order to get the latest dataset link to downlaad, you need to do a /GET request to the dataset endpoint.
Frequently Asked Questions
How is this guide?
Last updated on
Receive TheirStack events in your webhook endpoint
Learn how to build your own webhook endpoint to get notified when events occur (new jobs, tech changes, etc) in TheirStack and trigger actions in your external systems like N8N, Zapier, Make, Airtable, etc
Affiliate Program
Learn how TheirStack's Affiliate Program lets you earn up to 20% commission by referring new customers. Discover how payouts work, the commission structure, and how to get started.