
Software Engineer, Data Infrastructure & Acquisition
Speechify
Colombo
1 applicant
Posted
Jan 5, 2026
Job Type
Full-Time
Work Mode
On-site
Salary
Salary negotiable
Job Description
Mission: Speechify ensures that reading is never a barrier to learning.
Over 50 million people use Speechify’s text-to-speech products to turn PDFs, books, Google Docs, news articles, websites, and more into audio, allowing them to read faster, remember more, and learn better.
Speechify’s products include the iOS App, Android App, Mac App, Chrome Extension, and Web App. Google named Speechify Chrome Extension of the Year, and Apple awarded Speechify the 2025 Design Award for Inclusivity.
Today, nearly 200 people worldwide work in a fully distributed environment, including frontend and backend engineers, AI research scientists, and professionals from Amazon, Microsoft, Google, Stripe, Vercel, Bolt, and other high-growth startups.
Role Overview: Software Engineer – AI/Data
We are hiring for the Data side of our AI team. This role is responsible for all aspects of data collection to support model training operations. You will work on building high-quality datasets at petabyte scale by integrating infrastructure, engineering, and research.
This is a key role for someone who thinks strategically, enjoys fast-paced environments, and is passionate about building impactful user experiences.
What You’ll Do
- Source new audio data and integrate it into the ingestion pipeline
- Operate and extend cloud infrastructure for the pipeline, currently on GCP and managed with Terraform
- Collaborate with AI scientists to optimize cost, throughput, and quality, delivering richer datasets at scale
- Work with the AI Team and Speechify Leadership to craft the dataset roadmap for next-generation consumer and enterprise products
Ideal Candidate Profile
- BS/MS/PhD in Computer Science or a related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Experience with Docker and Infrastructure-as-Code (Terraform)
- Professional experience with at least one major Cloud Provider (GCP preferred)
- Experience with web crawlers or large-scale data processing workflows is a plus
- Ability to handle multiple tasks and adapt to changing priorities
- Strong written and verbal communication skills
Why Join Speechify
- Fast-growing, impactful environment where your contributions shape the company and products
- Work in an entrepreneurial team that values risk, intuition, and hustle
- Hands-off management approach fostering focus, creativity, and autonomy
- Opportunity to make a difference in a product that changes lives for people with learning differences such as dyslexia, ADD, low vision, concussions, autism, and more
- Competitive compensation and career development opportunities
- Work at the intersection of AI and audio, a rapidly evolving tech domain
- Fully distributed, remote-first work culture
Diversity & Inclusion
Speechify is committed to a diverse and inclusive workplace. We do not discriminate based on race, national origin, gender, gender identity, sexual orientation, veteran status, disability, age, or other legally protected status.
Think you’re a good fit? Include your portfolio and LinkedIn when applying.
Know someone perfect for the role? Refer them!