
Building Datasets for Sovereign AI
1 week ago
Job Title: Senior Data Engineer
We're building sovereign AI models in Australia from the ground up. We train foundation models, design new reasoning architectures, and deploy them on cutting-edge GPU clusters. Our models run on datasets we create ourselves, curated, cleaned, and engineered for high performance at scale.
You'll lead the design and construction of these datasets as a Senior Data Engineer. You'll work hands-on to source, clean, transform, and structure massive amounts of raw data into training-ready form. You'll design the architecture that powers data ingestion, validation, and storage for large-scale AI training.
This is a deep technical role that requires you to think like both a data scientist and a systems engineer. You'll design for correctness, scalability, and future proofing. If you want to build the datasets that power sovereign AI from first principles, this is your opportunity.
- Design and build large-scale data ingestion and curation pipelines
- Source, filter, and process diverse data types
- Implement robust quality control and validation systems
- Architect storage and retrieval systems
- Build tooling to track dataset lineage and reproducibility
- Collaborate with AI Researchers and Engineers
- Continuously improve ingestion speed and data freshness
About Us: We're a small team building advanced AI systems in Australia. We create new foundation models from scratch and build the datasets they run on from the ground up. We operate our own GPU clusters and integrate research and engineering closely to push the frontier of what's possible.
What You'll Get: You'll be surrounded by people who care deeply about data quality and architecture, build systems that scale reliably and repeatably, take pride in learning and experimenting, and want to help Australia build independent, world-class AI systems.
-
Melbourne, Victoria, Australia beBeeDataEngineer Full time $120,000 - $180,000Senior Data Engineer Job DescriptionMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This involves...
-
AI Dataset Constructor
6 days ago
Melbourne, Victoria, Australia beBeeDataEngineering Full time $150,000 - $180,000Job DescriptionThe role of a Senior Data Engineer involves leading the design and construction of large-scale datasets for AI training. This encompasses sourcing, cleaning, transforming, and structuring massive amounts of raw data into training-ready form.Key ResponsibilitiesDeveloping large-scale data ingestion and curation pipelines for AI training...
-
Building Sovereign AI Models
2 weeks ago
Melbourne, Victoria, Australia beBeeSoftware Full time $150,000 - $180,000AI Software Development PositionThis is a senior role in software engineering where you will be working on building sovereign AI models.About the RoleYou'll collaborate with our research team to implement cutting-edge algorithms and ideas.Design high-performance systems that run these models in real-world environments.Ongoing learning and exploration of new...
-
Sovereign AI Model Architect
5 days ago
Melbourne, Victoria, Australia beBeeAI Full time $180,000 - $200,000About the RoleMaincode is building cutting-edge sovereign AI models from scratch. Our team is responsible for designing and implementing foundation models that are tailored to Australian needs.This is a unique opportunity to be part of a pioneering project in the field of AI. As a key member of our team, you will work closely with researchers and engineers...
-
Senior AI Data Specialist
2 weeks ago
Melbourne, Victoria, Australia beBeeData Full time $150,000 - $180,000Job OverviewWe are seeking an experienced professional to join our team as a Senior Data Engineer. In this role, you will be responsible for designing and building large-scale data ingestion and curation pipelines for AI training datasets.The ideal candidate will have a strong background in data engineering frameworks such as Apache Spark, Ray, or Dask, and...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia MainCode Full time $150,000 - $180,000 per yearOverviewMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Maincode Full time $150,000 - $180,000 per yearOverviewMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or...
-
Senior Data Engineer
1 week ago
Melbourne, Victoria, Australia Maincode Full timeOverviewMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Maincode Full timeOverviewMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Maincode Full timeOverviewMaincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or...