
Senior Data Engineer
2 weeks ago
Maincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or scraping without thought. This is building world-class datasets from the ground up.
As a Senior Data Engineer, you will lead the design and construction of these datasets. You will work hands-on to source, clean, transform, and structure massive amounts of raw data into training-ready form. You will design the architecture that powers data ingestion, validation, and storage for multi-terabyte to petabyte-scale AI training. You will collaborate with AI Researchers and Engineers to ensure every byte is high quality, relevant, and optimised for training cutting-edge large language models and other architectures.
This is a deep technical role. You will be writing code, building pipelines, defining schemas, and debugging unusual data edge cases at scale. You will think like both a data scientist and a systems engineer, designing for correctness, scalability, and future proofing. If you want to build the datasets that power sovereign AI from first principles, this is your team.
What you'll do
- Design and build large-scale data ingestion and curation pipelines for AI training datasets
- Source, filter, and process diverse data types including text, structured data, code, and multimodal, from raw form to model-ready format
- Implement robust quality control and validation systems to ensure dataset integrity, relevance, and ethical compliance
- Architect storage and retrieval systems optimised for distributed training at scale
- Build tooling to track dataset lineage, reproducibility, and metadata at all stages of the pipeline
- Work closely with AI Researchers to align datasets with evolving model architectures and training objectives
- Collaborate with DevOps and ML engineers to integrate data systems into large-scale training workflows
- Continuously improve ingestion speed, preprocessing efficiency, and data freshness for iterative training cycles
Who you are
- Passionate about building world-class datasets for AI training from raw source to training-ready
- Experienced in Python and data engineering frameworks such as Apache Spark, Ray, or Dask
- Skilled in working with distributed data storage and processing systems such as S3, HDFS, or cloud object storage
- Strong understanding of data quality, validation, and reproducibility in large-scale ML workflows
- Familiar with ML frameworks like PyTorch or JAX, and how data pipelines interact with them
- Comfortable working with multi-terabyte or larger datasets
- Hands-on and pragmatic, you like solving real data problems with code and automation
- Motivated to help build sovereign AI capability in Australia
Why Maincode
We are a small team building some of the most advanced AI systems in Australia. We create new foundation models from scratch, not just fine-tune existing ones, and we build the datasets they run on from the ground up.
We operate our own GPU clusters, run large-scale training, and integrate research and engineering closely to push the frontier of what is possible.
You will be surrounded by people who:
- Care deeply about data quality and architecture, not just volume
- Build systems that scale reliably and repeatably
- Take pride in learning, experimenting, and shipping
- Want to help Australia build independent, world-class AI systems
#J-18808-Ljbffr
-
Senior Data Engineers
2 weeks ago
Melbourne, Victoria, Australia Otic Group Full timeJoin to apply for the Senior Data Engineers role at Otic Group1 day ago Be among the first 25 applicants Join to apply for the Senior Data Engineers role at Otic Group"Otic" means smart people doing smart work, together.We are a wholly owned Australian company committed to helping our clients design and build intelligent software solutions that unlock value...
-
Senior Mechanical Engineer, APAC
3 weeks ago
Melbourne, Victoria, Australia Vantage Data Centers Full timeAbout Vantage Data CentersAbout Vantage Data CentersVantage Data Centers powers, cools, protects and connects the technology of the world's well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Emanate Technology Full timeGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Emanate Technology Principal Consultant - Data & AI | Tech Recruitment Senior Data Engineer Location: Melbourne Work Type: Full-Time, Hybrid Department: Technology The Role We're seeking a Senior Data Engineerto join a leading Melbourne CBD technology...
-
Senior Data Engineers
2 weeks ago
Melbourne, Victoria, Australia Otic Group Full timeJoin to apply for the Senior Data Engineers role at Otic Group1 day ago Be among the first 25 applicantsJoin to apply for the Senior Data Engineers role at Otic Group"Otic" means smart people doing smart work, together.We are a wholly owned Australian company committed to helping our clients design and build intelligent software solutions that unlock value...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Emanate Technology Full timeGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Emanate TechnologyPrincipal Consultant - Data & AI | Tech RecruitmentSenior Data EngineerLocation:MelbourneWork Type:Full-Time, HybridDepartment:TechnologyThe RoleWe're seeking a Senior Data Engineerto join a leading Melbourne CBD technology business. You'll help...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Emanate Technology Full timeGet AI-powered advice on this job and more exclusive features.Direct message the job poster from Emanate TechnologyPrincipal Consultant - Data & AI | Tech RecruitmentSenior Data EngineerLocation:MelbourneWork Type:Full-Time, HybridDepartment:TechnologyThe RoleWe're seeking a Senior Data Engineerto join a leading Melbourne CBD technology business. You'll help...
-
Senior Data Engineer
4 weeks ago
Melbourne, Victoria, Australia Commonwealth Bank Full timeJoin to apply for the Senior Data Engineer (AWS Cloud) role at Commonwealth Bank1 week ago Be among the first 25 applicantsJoin to apply for the Senior Data Engineer (AWS Cloud) role at Commonwealth BankGet AI-powered advice on this job and more exclusive features.You are determined to stay ahead of the latest Cloud, Big Data and Data warehouse...
-
Senior Data Engineer
4 weeks ago
Melbourne, Victoria, Australia Commonwealth Bank Full timeJoin to apply for the Senior Data Engineer (AWS Cloud) role at Commonwealth Bank1 week ago Be among the first 25 applicantsJoin to apply for the Senior Data Engineer (AWS Cloud) role at Commonwealth BankGet AI-powered advice on this job and more exclusive features.You are determined to stay ahead of the latest Cloud, Big Data and Data warehouse...
-
Senior Data Engineer
3 weeks ago
Melbourne, Victoria, Australia Talent Full timeJoin to apply for the Senior Data Engineer role at Talent1 day ago Be among the first 25 applicantsJoin to apply for the Senior Data Engineer role at TalentGet AI-powered advice on this job and more exclusive features.We are seeking an experienced and strategic Senior Data Engineer to join this utilities organisation.As a Senior Data Engineer, you'll be...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia iterate Full timeGet AI-powered advice on this job and more exclusive features.We are partnered with a homegrown Australian business who are looking for a Senior Azure Data Engineer to take ownership of the modernisation their data infrastructure. This is a unique opportunity to lead the development of a centralised data platform, work with cutting-edge Azure technologies...