What is TheirStack?
TheirStack is a job and technographic database that helps companies find clients. We scrape jobs and company information from the internet in almost real-time and let our users consume it in a structured, straight-forward way.
We scrape about 400k jobs every day, receive ~1M requests/day, have hundreds of paying customers and thousands of monthly active users.
Our data is leveraged by customers from different verticals, such as:
- B2B SaaS companies like Superflex.ai (a tool that generates code from Figma files), pull with our UI lists of companies utilizing a technology (Figma in their case) to drive their cold email outreach.
- Job boards like EnergyHire - the largest energy-sector recruiting agency - use our Jobs API to backfill their job board with all jobs publish on Linkedin, Indeed, Glassdoor, Workable…
- Salestech companies like Databar.ai use our Technographics API and Jobs API to provide jobs and technology usage data so that their users can power their outbound campaings with it
- Security companies use our full datasets of 70M jobs and the full tech stack of 5M companies to identify third party risks
Our company - philosophy
We haven’t raised any money and don’t have plans to do so.
Our goal is to keep building a sustainable, profitable company. Small in headcount and big in revenue per employee.
Success for us means living a good life solving intellectually stimulating problems and building a good product that ‘sells itself’.
We’re a company founded and driven by engineers, focused on creating exceptional solutions. Our focus is on building a product that speaks for itself, without relying heavily on traditional sales efforts. Every decision we make is guided by the question: 'Can this be discovered, used, and purchased autonomously by our users?’
We prefer simple rather than complex solutions.
We try to understand the technologies we use as much as we can. Finding the efficient solution is better long-term than just throwing money/machines at problems, and this is only possible if we know how things work underneath.
We’ll never invest 1$ to get $1.10. Finding and focusing on what has the highest ROI has been and will keep being essential to being a small team.
We’re fine with working remotely for months, but most of the year we work in the same office and for our stage this is very important, and we’d rather find someone that can be in the office with everyday. We have nice ocean views :)
How we got here - our story
Xoel started NoiceJobs in early 2020. It was a bunch of job boards for which he had to build a lot of job boards and a way to store those jobs.
TheirStack was born as a pivot from that in late 2021 to explore several hypotheses:
- jobs signal companies’ intents, problems and needs, and can be used as sales triggers
- jobs can also be used to infer the tech stack of companies
In 2023, Christian joined the team to amplify marketing and sales efforts, but thanks to his technical background he made lots of improvements in the backend, as well as all the work on the frontend both for our landing page and product.
And in 2024, we grew a lot, with double-digit growth in users and revenue every month.
What will you do?
We are looking for someone with experience in Data Engineering.
You’d help us solve some hard problems involving entity resolution and data deduplication that need to solve, and similar problems that will arise in the future.
You don’t need to check every box below or use all the technologies we use, but what matters more is that you are excited to learn, adapt, and grow alongside our evolving company needs.
Your skills
- Entity Resolution Fuzzy/probabilistic matching of companies via domains, LinkedIn IDs, logos, names, and other signals.
- Job Deduplication Time-aware and content-aware deduplication using job similarity, reposting patterns, and ML.
- ClickHouse Expertise Schema design, projections, partitioning, and materialized views for fast ingestion and querying at scale.
- Real-Time Data Processing Designing ingestion pipelines that classify, deduplicate, and assign company identity on the fly.
- Scalable Reprocessing Systems that reprocess 100M+ records within hours — enabling batch updates and re-training.
- Backend Engineering (Senior Level) Strong software engineering practices to build scalable, reliable APIs; experience with performance, monitoring, observability, and production readiness.
- AI for Web Scraping Automation (Future Work) Interest or experience in automating scraper generation using AI — e.g., analyzing browser network requests to reverse engineer internal APIs.
What you will work on
These are some actual problems we’re looking that the person we hire will help us solve.
- Build a real-time job ingestion pipeline that resolves company identity and deduplicates incoming data.
- Design and maintain a ClickHouse schema for fast querying of new and reposted jobs.
- Own backend components powering this pipeline, applying robust engineering practices (tests, observability, scalability).
- Enable fast, repeatable reprocessing of historical job data (100M+) for retroactive improvements.
- Collaborate on long-term R&D around AI-assisted scraper generation, reducing time to ingest new job sources.
About you
You are a curious person and can learn things on your own.
You like learning about the internals of the systems you work with.
You’re familiar with testing, value code quality and maintainability.
You are a team player, like teaching what you know and helping others grow as well.
You don’t have to be managed and can think of the right tradeoffs to be made to find the right technical solutions given the business context.
You don’t need to be fluent in English, but should be able to read and write in English, as most of our customers are international.
How we work
When will you work?
We work 37.5 hours per week — usually around 8 hours a day from Monday to Thursday, starting at 9:00 a.m. You’re free to start earlier if that fits your routine better. On Fridays, we typically wrap up a bit early — though you can shift that time to another day if needed.
You’ll get 25 vacation days each year, along with the flexibility to work remotely for a few weeks if you want.
We also maintain a more relaxed schedule during summer months (July to August) working 7 hours per day.
Where will you work?
Our team is based in A Coruña, Spain, where we share a co-working space and collaborate in person. This is an on-site role — we value the creativity and connection that come from working together — but we’re also flexible when life demands it.
We understand that everyone has personal responsibilities, whether that’s family, caregiving, or other commitments. This role is based in Coruña and you're expected to be in the office most days, but you can work remotely for a few weeks each year if you want — whether that’s from the Canary Islands, the mountains, or anywhere that inspires you.
Learning & Development
We believe the company grows best when you do. That’s why we offer a dedicated budget for training, courses, and conferences — anything that helps you expand your skills and bring new value to the team. If there’s something you want to learn, we’ll support you in making it happen.
The team
As the second employee at TheirStack, you'll work closely with the founders, helping to shape the company's future:
- Xoel Lopez: Previously at Zara.com, Narrativa, and Tinybird, Xoel specializes in data and web scraping. He kickstarted TheirStack by developing the MVP and building the core data infrastructure, the scrapers and the API, from the ground up.
- Christian Palou: Previously at Manomano.com and Zara.com, Christian focuses on UX, front-end and backend development at TheirStack. Before TheirStack, he launched TrainerStudio.io, a SaaS for personal trainers that was acquired last year.
- Pablo Durán: Previously at Fever, Sngular. Pablo specializes in full stack development (data, backend, frontend, devops, etc). He joined TheirStack in June 2024.
Compensation
40-80k€/year
Location
On-site position at our co-working space in A Coruña, Spain
Interview process
Interview process
- One 30-min chat with us to get to know you, tell you a bit more about us and the position.
- A 30-min chat to learn more about your soft skills and your technical experience.
- A take-home test to spend 3-4h on. You can use AI on it, we care more about how you think and what production-ready means for you.
- Another 30-min short call to discuss the test.
- An offer from us.
- And as many extra calls you and us need to get to know each other, the code… better.
One more thing
We’re flexible about many things (salary, profit share, location), and everything is pretty much negotiable. Nothing is written in stone.
Even if you don’t tick all boxes but still think this is an interesting position, are excited about the challenges we have and think you can help us make the pie larger, apply!
How to apply
Apply through this link