Leonardo.Ai Logo

Leonardo.Ai

Research Engineer - Data

Job Posted 15 Days Ago Posted 15 Days Ago
Be an Early Applicant
Remote
2 Locations
Mid level
Remote
2 Locations
Mid level
As a Research Engineer - Data, you will manage data pipelines, work on multi-modal datasets, optimize data processing systems, and collaborate with researchers to develop innovative AI solutions.
The summary above was generated by AI

Leonardo.Ai, now part of the Canva family, is on a mission to redefine creativity through cutting-edge generative AI. Our platform empowers millions worldwide to effortlessly produce high-quality images, videos, and more. With nearly a quarter of a billion users, we’re building a world-class R&D team to push the boundaries of AI creativity. 
The Role:

As a Research Engineer – Data at Leonardo, you will architect and manage petascale data pipelines, combining text, images, 3D models, and other data modalities to drive world-class AI models. You’ll work hand-in-hand with our Researchers to create and curate large, multi-modal datasets, including synthetic data, that supercharge SOTA generative AI solutions. Your expertise in distributed systems, data processing, and experimentation will shape the backbone of our research work. 
Responsibilities:

  • Data Acquisition & Curation
    Lead the ingestion, unification, and organization of large, unstructured data sources (e.g., text, images, 3D geometry, code snippets) into scalable, high-quality datasets suitable for machine learning research and production.

  • High-Performance Data Pipelines
    Develop and optimize distributed systems for data processing, including filtering, indexing, and retrieval, leveraging frameworks like Ray, Metaflow, Spark, or Hadoop.

  • Synthetic Data Generation
    Build and orchestrate pipelines to generate synthetic data at scale, advancing research on cost-efficient inference and training strategies.

  • Experiments & Analysis
    Design and conduct experiments on dataset quality, scalability, and performance. 

  • Security & Compliance
    Collaborate with legal and safety teams to ensure all data usage respects privacy, security, and ethical standards.

  • Open-Source Contributions
    Contribute to internal and external libraries or frameworks, sharing insights and breakthroughs with the wider AI community through publications or technical blogs.

Skills we like you to have:

  • Multi-Modal Data Expertise
    Hands-on experience with images, videos, 3D geometry (mesh/solid modeling), and/or text data. Well-rounded expertise in Python and PyTorch. 

  • Synthetic Data & Inference
    Passion for synthetic data generation making use of inference of pretrained models, 3D rendering engines, and/or other softwares. 

  • Distributed Computing & MLOps
    Demonstrated proficiency in setting up large-scale, robust data pipelines, using frameworks like Spark, Ray, or Metaflow. Comfortable with model versioning, and experiment tracking.

  • Performance Optimization
    Good understanding of parallel and distributed computing. Experienced with setting up evaluation methods

  • Cloud & Storage Systems
    Experience with AWS, Azure, or other cloud platforms. Proficient in both relational (MySQL, PostgreSQL) and NoSQL (MongoDB, Cassandra) databases, plus vector data stores.

Our Culture

  • Inclusive Culture: We celebrate diversity and are committed to creating an inclusive environment where everyone feels valued and empowered. Your unique perspectives and experiences are essential to our success.

  • Flexible Work Environment: We understand the importance of work-life balance. Thrive personally and professionally with the option to work remotely or in our vibrant offices.

  • Empowering Growth: We invest in your development with continuous learning opportunities and clear pathways for career advancement tailored to your goals. 

  • Meaningful Impact: Be part of shaping the future of AI and contribute to innovative projects with global impact.

If you’re passionate about building scalable data ecosystems that fuel the next frontier of AI innovation—and you’re excited to collaborate with top-tier researchers and engineers—join us at Leonardo.Ai to make creativity boundless for everyone.

Top Skills

AWS
Azure
Cassandra
Hadoop
Metaflow
MongoDB
MySQL
Postgres
Python
PyTorch
Ray
Spark

Similar Jobs

3 Days Ago
Remote
32 Locations
Senior level
Senior level
Artificial Intelligence
As a Senior Research Engineer in Data, you'll manage the data lifecycle for AI research, optimize deep learning pipelines, and work extensively with large video and audio datasets.
Top Skills: KubeflowPythonSpark
15 Days Ago
Remote
2 Locations
Mid level
Mid level
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
As a Research Engineer, you'll lead projects on data acquisition and curation, develop scalable systems for multi-modal datasets, and work on data analysis and visualizations to support AI features.
Top Skills: AWSAzureCadCassandraHadoopHiveMachine LearningMetaflowMongoDBMySQLPostgresRay DataSpark
2 Hours Ago
Remote
4 Locations
Expert/Leader
Expert/Leader
Artificial Intelligence • Automotive • Computer Vision • Information Technology • Internet of Things • Logistics • Software
The Senior Principal Architect is responsible for designing robust architectures for strategic customer use cases within the automotive sector, engaging with stakeholders to create impactful solutions, mentoring engineering teams, and analyzing customer requirements to support project lifecycles.
Top Skills: Ad SystemsAdasAutomotive SoftwareAWSAzureCloud ServicesConnected CarsEnvironmental Model BuildingGCPMap DataNavigation Data Standard (Nds)PositioningSdv SolutionsSimultaneous Localization And Mapping (Slam)Vehicle E/E ArchitecturesVehicle Sensor Data

What you need to know about the Manchester Tech Scene

Home to a £5 billion digital ecosystem, including MediaCity, which consists of major players like the BBC, ITV and Ericsson, Manchester is one of the U.K.'s top digital tech hubs, at the forefront of advancements in film, television and emerging sectors like as e-sports, while also fostering a community of professionals dedicated to pushing creative and technological boundaries.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account