Introduction

What is TheirStackProduct updatesBlogAPI Reference

Data

Job Data
Data workflowFreshness & Scraping FrequencyJob sourcesJob status (active, closed)Use cases
Technographic Data

Product

App
Webhooks
Datasets

Pricing and Billing

Pricing
Affiliate Program

Integrations

ClayMake

Guides

Adding a technology or job filter to your company searchBackfilling a job boardHow to fetch jobs periodically using the Jobs APIHow to Choose the Best Way to Access TheirStack DataHow to find old jobsHow to get access to datasetsHow to identify companies with problems your software solvesOutreach companies actively hiringHow to send a slack message for every new job foundMonitoring open jobs from current and past customersIntegration guide for sales intelligence software

Other

Users and Teams
TheirStack TheirStack Logo

Docs

Job Data

Freshness & Scraping Frequency

Discover how we collect job data with multi-tiered scraping frequencies from 10 minutes to daily.

Real-Time Job Data Updates

At TheirStack, we maintain a sophisticated job scraping system that ensures you have access to the most current job postings. Our multi-tiered approach to data collection means you never miss out on relevant positions.

Job Scraping Frequency

We use different scraping frequencies based on the source and volume of job postings:

  • Every 10 Minutes: High-volume job boards and frequently updated company career pages
  • Every Hour: Medium-traffic job sources with regular updates
  • Daily: Smaller job boards and company websites with lower posting frequency

This tiered approach ensures that within 24 hours, we capture approximately 90% of all new tech job postings across our monitored sources.

Job Discovery Timeline

The following visualization shows when we discover jobs posted on a specific date (April 24th):

Tech job discovery timeline visualization

As illustrated, while most jobs are discovered within the first 24 hours, our system continues to find additional positions over subsequent weeks. This means:

  • Initial job counts for the current day may appear lower than final numbers
  • Job discovery continues over time, increasing the total count
  • You'll see the most comprehensive data after about 2-3 days

FAQS

Understanding delays in job discovery

When analyzing our data, you'll notice that a small percentage of jobs are discovered (discovered_at) after the day when they were initially posted (posted_at). This delay is common and is influenced by various factors beyond our control.

Discovery Timeline

To illustrate this, let's examine jobs posted on a specific day (December 14th) and observe when they were discovered by us.

Jobs posted on December 12th

Discovered AtPercentage of Jobs Discovered
Same Day (0)73%
Next Day (1)21%
2 Days Later3%
3 Days Later0.3%
4 Days Later0.2%

As shown, 73% of jobs are found on the same day they are posted. However, there is a significant number of jobs discovered days later. To understand this, it's important to know how we collect our data.

Factors Contributing to Delays

We scrape job boards, that at the same time scrape other job boards and career pages of companies. They also can get jobs posted directly by companies, or from companies’ ATS systems that sync with these job boards.

While we scrape these job boards constantly, there are many reasons a job posted by a company at a certain date is not available on those job boards instantly, and therefore it’s not possible for us to discover it. To name a few:

  • ATSs sync delay with job boards: The ATS that a company uses lets them sync jobs with a major job board, but the recruiter can choose which jobs to push to that job board because they charge for it. They may not initially sync it and do it after a few days to try to get more candidates. But the integration may keep the original date when the job was posted first, and show that in the final job board, instead of the date when the job was pushed. In this case, there will be a gap of a few days.

  • Job board scrapping delay: A job board scrapes company career pages periodically, running daily. If a company posts a job at 14h and the job board scraper visits it at 10h every day, that job won’t be available in the job board until the day after. For companies that post many positions, it makes sense for job boards to visit those career pages with a high frequency. But visiting every career page of every company in the world periodically has a cost, so for smaller companies that very ocasionally post jobs, doing it on a weekly basis could help those job boards save money. So imagine they visit one of these companies’ career site every Monday. If they post a job on Tuesday, they won’t visit it again until next Monday, so there will be a 6-day difference between posted_at and discovered_at

  • Job board publishing delay: If someone publishes a job directly on a job board we scrape, this job board may also let them set a custom posted_at that is days before the current date. But the job is not available at that job board until the very moment when that person publishes it there, and even if the reported posted_at is previous to that, that job wouldn’t have been discovered before because that person hadn’t published it in that job board yet.

How is this guide?

Last updated on

Data workflow

Learn how we transform raw job postings into high-quality, normalized data through our 5-step workflow covering acquisition, extraction, deduplication, enrichment, and quality assurance

Job sources

Our platform aggregates job listings from over 16,000 different websites. Below you'll find a breakdown of our largest job data sources and their contributions.

On this page

Real-Time Job Data Updates
Job Scraping Frequency
Job Discovery Timeline
FAQS
Understanding delays in job discovery
Discovery Timeline
Factors Contributing to Delays