--- title: Datasets description: Access complete jobs, technographics, and company datasets — delivered as CSV or Parquet, with historical coverage and optional daily updates via S3. url: https://theirstack.com/en/docs/datasets --- ## What are datasets? Datasets give you direct access to our complete database of jobs, technology usage, and company information. Think of them as your raw data goldmine – perfect when you need maximum flexibility for advanced analytics, machine learning projects, or custom analysis. We've got two powerful datasets ready for you: - **[Jobs Dataset](/en/jobs-dataset)**: Access our complete collection of 205M job postings from 195 countries, dating back to 2021. Each job record includes essential details like job title, description, salary, location, company name, company URL, company industry, company size... Check out the complete [data dictionary](https://app.theirstack.com/dataset) for all available fields. - **[Technographics Dataset](/en/technographics-api)**: Dive into 49M technology usage signals across [12M companies](/en/docs/data/company/statistics) using 32k different technologies. This isn't just one file – you'll get three comprehensive datasets: the main technographics data (company\_id, technology\_slug, confidence\_score, n\_jobs...), detailed company profiles (including domain, country, revenue, and employee count), and a complete technology catalog with descriptions and categories. Check out the complete [data dictionary](https://app.theirstack.com/dataset) for all available fields. ## Frequency of updates TheirStack offers three flexible dataset access options to meet your data needs: - **Historical Access**: Receive a one-time download link containing all available records at the time of purchase. - **Daily Updates**: Get daily download links with new records added to the dataset, including delta files for incremental updates. - **Complete Access**: Combine both historical and daily updates for comprehensive data coverage. ## Delivery format All datasets are delivered with a link to a S3 bucket. The format of the dataset can be CSV or Parquet. ## Data structure - [Jobs dictionary](/en/docs/datasets/options/job) - [Companies dictionary](/en/docs/datasets/options/company) - [Technographics dictionary](/en/docs/datasets/options/technographic) - [Technologies dictionary](/en/docs/datasets/options/technographic) ## How to get lastest dataset link In order to get the latest dataset link to download, you need to do a /GET request to the [dataset endpoint](/en/docs/api-reference/datasets/get_datasets_v1). ## Frequently Asked Questions #### How often are files refreshed? The dataset is refreshed daily with new records added to the dataset. Delta files are available for each day. #### How can i validate the quality of the data? The dataset has the same quality as the data in the [app](https://app.theirstack.com) or the API. All the data delivery options comes from the same database. So you can use our app.theirstack.com or the API to validate the data. #### What is the best for me the API or the dataset? Datasets are the right choice if you need to get data for more than 10M records / month. If you need less, the API is the best option.