--- title: Sources description: Our platform aggregates job listings from over 336k different websites. Below you'll find a breakdown of our largest job data sources and their contributions. url: https://theirstack.com/en/docs/data/job/sources --- ## Job data sources We collect [job data](/en/docs/data/job) from over 336k different websites across the web. All these sources are crawled using publicly available data. We scrape some websites directly, while others are accessed indirectly through job board aggregations. When we break down jobs by source, a single job can be associated with multiple sources (for example, a company career page and one or more job boards). Because we count a job once per source, the sum of jobs across all sources can be higher than the total number of unique jobs in our dataset. Below, you'll find a breakdown of our largest job data sources and their contributions. If you need the full list, contact us. ## FAQS: Frequently asked questions #### How can I see if you have jobs from a specific website? You can see whether we have jobs from a specific website by going to the [app](https://app.theirstack.com) and searching for jobs while filtering by `url_domain`. For example, [here's a search showing jobs from LinkedIn](https://app.theirstack.com/search/jobs/new?query=N4IgjgrgpgTgniAXKAlgOwMYBsIBMoD6ALgPZECGWBMUAzhFkbUgGaW1QA0IJM+MBAEYJEAbVD5aGJERjRuLFFCy4kIXOSKEADiVpbVAXwC63Xfqi4CmggFtyAD2sBzQhrjNEARgCs3CDBUuCT26AS8SKIgWOgA1pboAHQYISDGhtzkEKQEHOQwGAAWMnJQhkA). #### How can I request a new job source? Companies use job boards to promote their jobs, and job boards constantly scrape each other to keep their catalog strong. That means the same role can appear under different sources, so not seeing a particular source doesn't necessarily mean we don't have the job. Before you request a new source, please: - **Make sure we don't already have jobs from this source** by following [the process mentioned above](#how-to-check-jobs-from-specific-website). - **Pick a few jobs and search for the company** in our [company lookup](/en/docs/app/company-lookup), review their latest jobs, and confirm the titles you're interested in aren't already there. - **Work out what type of source it is** — (1) a job board, (2) a generic job board, or (3) a company website — and estimate how many jobs it has in the last 30 days. - Contact our support team through our [app](https://app.theirstack.com). Find the "Contact support" button when you click on the `?` icon in the top left corner of the page, next to our logo. #### Example of good request email. Use this template and replace the placeholders with the actual information from your specific case. ``` I'd like to add [WebsiteName] (https://example.com) as a new source. It's a job board focused on _____. I've used the company lookup tool to find jobs from [WebsiteName] and these jobs from it are NOT present in TheirStack: - [link to job 1 on [WebsiteName]] - [link to job 2 on [WebsiteName]] - [link to job 3 on [WebsiteName]] - ... - [link to job N on [WebsiteName]] Out of the [Number of companies I checked], I found [Number of companies] that have jobs from [WebsiteName] and these jobs are not present in TheirStack. In total, I looked for [Number of of jobs]. Out of those, I found [Number of jobs] in TheirStack. I estimate this source has roughly [Number of jobs] new jobs per month. Thanks, ``` #### Why don't we recommend filtering by job scraping source? These days, roles usually originate in a company’s ATS and are then published across platforms like LinkedIn, Indeed, and Glassdoor. A single job can appear in multiple places almost instantly. We crawl many sources and store each job only once — the first time we find it. For example, if we discover a role on Greenhouse before it appears on LinkedIn, the job in TheirStack won’t list LinkedIn as its source, even if it’s also posted there. Because of this first-seen approach, filtering by a specific scraping source (e.g., `scraping_source = Indeed`) will only return jobs initially found on that source, which often reduces results and can be misleading. We recommend avoiding this `scraping_source` filter unless you're aware that you'd be missing jobs that we actually have if you use it. #### Why do I see a smaller number of job listings on TheirStack compared to platforms like Indeed or LinkedIn? When you search for a job on Indeed or LinkedIn, you may see more results than on TheirStack. This is because Indeed uses broader keyword matching, while TheirStack prioritizes precision. The difference lies in how job searches are conducted on TheirStack versus platforms like LinkedIn, Indeed, or Glassdoor. Let's focus on Indeed as an example, though the same principles apply to other providers. #### Search methodology: Keyword matching Job boards like Indeed use broader keyword matching: - **Synonyms and Related Jobs**: For example, when you search for "network engineer," Indeed may return jobs that loosely match those terms, even if they don't contain the exact keywords. - **Company Name Matching**: For example, when you search for "franchise", LinkedIn may return jobs where the company name contains the word 'franchise'. On TheirStack, we prioritize precision. Our job title filter is deterministic, meaning it only returns results with an exact match for the keywords you provide. This approach ensures greater accuracy in the search results. #### Search methodology: Date range Indeed and LinkedIn typically display jobs that have been posted or reposted within the last 30 days by default, whereas TheirStack does not automatically filter jobs based on their reposted date. #### How to Increase Results on TheirStack - **Add More Keywords**: Broaden your search terms to include additional relevant keywords (see Picture A). - **Use Regex Patterns**: Leverage regular expressions for more flexible and advanced search queries (see Picture B). #### Upcoming Features We're working on a new filter that incorporates a synonym dictionary, which will enhance your search capabilities. We're also considering a filter to search by reposted date — this is in our backlog and we'll prioritize it based on customer interest. #### Why do I see fewer jobs on TheirStack than on a company's career page? There are two common reasons: 1. Coverage and timing: If a company has just posted a role, there can be a short delay before we discover it. Also, while we scan many company sites, our coverage prioritizes major job boards like Indeed, LinkedIn, Workable, and Welcome to the Jungle. Jobs reposted to these portals are more likely to be picked up quickly. If a role only appears on the company website and isn’t syndicated to any job boards, we may not have it yet. 2. Deduplication: Companies often repost the same role multiple times over several weeks. We deduplicate aggressively to avoid listing the same job multiple times. If we find a job with the same title from the same company that was already posted within the last 30 days, we keep the earliest one and ignore later duplicates. This prevents clutter, avoids overcounting, and ensures customers aren't charged multiple [credits](/en/docs/pricing/credits) for the same underlying job.