Market opportunity and competitive strengths
We estimate the market for high-quality, human-annotated data for Generative AI is currently worth around USD 2 billion, which will grow at more than 40% CAGR for next 5 years.
For successful model development, Generative AI needs huge amounts of high-quality data, which has to be accurate, diverse, legally compliant and produced at scale. This is a complex task and one of the bottlenecks of modern AI development.
Toloka’s customers include prominent names (big tech, Fortune-500 companies and A-class startups backed by top investors) across the entire spectrum of Generative AI companies, from the largest foundation model producers in global big tech to emerging AI-powered start-ups, and enterprises who are starting to integrate Generative AI solutions.
There are few companies with both the expertise and technological infrastructure to build robust and high-quality human data operations at scale. This is the market opportunity that Toloka is positioned to capture. To scale fast and efficiently, Toloka invests in technological solutions like marketplace, quality control systems and expert copilots.
Toloka also anticipates increasing demand for data evaluation services for GenAI development, as models are deployed in production in increasing numbers. This is a whole new market that provides additional opportunities for significant growth and expansion of the total addressable market.
Financial highlights
Toloka’s Generative AI offering is forecasted to grow multiple times this year, becoming Toloka's major source of revenue.
Following initial investments in the GenAI pivot, gross margins have shown steady month-over-month growth over the last quarter, reaching an expected range of 45-50% in the GenAI business stream. With continued focus on enhancing production efficiency, automation, and the use of synthetic data and auto-labeling, we see an opportunity to further improve our gross margins.
Our people
The Toloka team currently includes around 200 professionals with deep expertise in ML, engineering, and marketplaces. The core management team – including the CEO, CTO and CPO – is based in Amsterdam and includes alumni of Google, Microsoft, SAP, Yandex, McKinsey, Bain and Y Combinator-backed startups. Toloka’s founder and CEO is Olga Megorskaya, who has over a decade of experience in AI and data labeling, including extensive expertise managing distributed tech- and non-tech teams.
IP and technology
Toloka is transitioning from its legacy crowdsourcing platform, which is set to be shut down by the end of 2024, towards a new focus on Generative AI solutions.
The pivot from crowdsourcing to Generative AI data partner requires a radically different technology platform, which Toloka is currently building from scratch. The new platform entails creating data for Generative AI models instead of just labeling data and designed with enhanced safety, cybersecurity and privacy features.
While the new platform is expected to be fully up and running by the end of this year, Toloka is launching certain products and parts of the new platform as they are ready. In April 2024, Toloka also launched