
Senior Data Engineer
2 days ago
Maincode is building sovereign AI models in Australia. We are training foundation models from scratch, designing new reasoning architectures, and deploying them on state-of-the-art GPU clusters. Our models are built on datasets we create ourselves, curated, cleaned, and engineered for performance at scale. This is not buying off-the-shelf corpora or scraping without thought. This is building world-class datasets from the ground up.
As a Senior Data Engineer, you will lead the design and construction of these datasets. You will work hands-on to source, clean, transform, and structure massive amounts of raw data into training-ready form. You will design the architecture that powers data ingestion, validation, and storage for multi-terabyte to petabyte-scale AI training. You will collaborate with AI Researchers and Engineers to ensure every byte is high quality, relevant, and optimised for training cutting-edge large language models and other architectures.
This is a deep technical role. You will be writing code, building pipelines, defining schemas, and debugging unusual data edge cases at scale. You will think like both a data scientist and a systems engineer, designing for correctness, scalability, and future proofing. If you want to build the datasets that power sovereign AI from first principles, this is your team.
What you'll do
Design and build large-scale data ingestion and curation pipelines for AI training datasets
Source, filter, and process diverse data types including text, structured data, code, and multimodal, from raw form to model-ready format
Implement robust quality control and validation systems to ensure dataset integrity, relevance, and ethical compliance
Architect storage and retrieval systems optimised for distributed training at scale
Build tooling to track dataset lineage, reproducibility, and metadata at all stages of the pipeline
Work closely with AI Researchers to align datasets with evolving model architectures and training objectives
Collaborate with DevOps and ML engineers to integrate data systems into large-scale training workflows
Continuously improve ingestion speed, preprocessing efficiency, and data freshness for iterative training cycles
Who you are
Passionate about building world-class datasets for AI training from raw source to training-ready
Experienced in Python and data engineering frameworks such as Apache Spark, Ray, or Dask
Skilled in working with distributed data storage and processing systems such as S3, HDFS, or cloud object storage
Strong understanding of data quality, validation, and reproducibility in large-scale ML workflows
Familiar with ML frameworks like PyTorch or JAX, and how data pipelines interact with them
Comfortable working with multi-terabyte or larger datasets
Hands-on and pragmatic, you like solving real data problems with code and automation
Motivated to help build sovereign AI capability in Australia
Why Maincode
We are a small team building some of the most advanced AI systems in Australia. We create new foundation models from scratch, not just fine-tune existing ones, and we build the datasets they run on from the ground up.
We operate our own GPU clusters, run large-scale training, and integrate research and engineering closely to push the frontier of what is possible.
You will be surrounded by people who:
Care deeply about data quality and architecture, not just volume
Build systems that scale reliably and repeatably
Take pride in learning, experimenting, and shipping
Want to help Australia build independent, world-class AI systems
-
Senior Data Engineer
7 days ago
Melbourne, Victoria, Australia DNA Technology Services Full time $120,000 - $180,000 per yearWe're Hiring: Senior Data Engineer – Melbourne Location: Melbourne Job Type: Fixed Term – 1 year (potential to extend up to 2 more years)About the RoleWe are seeking aSenior Data Engineerto join our client's Data & Insights team. This role will be a key contributor to themigration of legacy reporting platformsand themodernization of data infrastructure....
-
Senior Data Engineer
5 days ago
Melbourne, Victoria, Australia V2 Digital Full time $120,000 - $180,000 per yearABOUT V2 AIV2 AI is a leading Data & AI consultancy backed by $30m in VC funding, allowing us to meet our customers' needs. We harness the power of Data & AI to accelerate business outcomes for some of the world's largest brands. We bring decades of experience and a unique delivery model to partner with our customers on the most complex problems for immense,...
-
Senior Data Engineer
1 week ago
Melbourne, Victoria, Australia INNOVATE IT AUSTRALIA Full time $150,000 - $200,000 per yearSenior Data Engineer – MainframeJob DescriptionMust Have SkillsPL SQLMainframeUnixDetailed Job DescriptionMandatory Skills:· Strong data engineering skills using SQL, Unix, OracleDetailed Job DescriptionMandatory Skills: · Strong data engineering skills using SQL, Unix, Oracle· Sound foundation in SQL and Shell scripting· Should have SQL Query...
-
Senior Data Scientist
2 days ago
Melbourne, Victoria, Australia Tech & Data People Full time $120,000 - $180,000 per yearIf you're the kind of data scientist who loves diving deep into probabilities, simulation models, and complex statistical challenges — this is a great role for you.You'll join a growing team that's building out the industry leading data science projects. This isn't about stakeholder decks or endless meetings; it's a hands-on role for someone who loves...
-
Senior Data Engineer
2 weeks ago
Melbourne, Victoria, Australia INNOVATE IT AUSTRALIA Full time $104,000 - $130,878 per yearTitle : Senior Data Engineer : : MainframeLocation :Melbourne, VICMust Have Skills:• PL SQL• Mainframe• UnixDetailed Job DescriptionMandatory Skills:• Strong data engineering skills using SQL, Unix, Oracle• Sound foundation in SQL and Shell scripting• Should have SQL Query optimization skills• Extensive experience working in an agile...
-
Principal Data Engineer
2 weeks ago
Melbourne, Victoria, Australia Tech & Data People Full time $250,000 per yearPrincipal Data Engineer – Melbourne – Hybrid - $250,000 - bleeding-edge Cloud & Data tech - enable next-gen analytics & AI Tech and Data People are working with a leading organisation that is driving large-scale digital and data transformation. We're looking for a Principal Data Engineer to design and deliver modern, secure, and scalable data...
-
Senior Data Engineer
2 days ago
Melbourne, Victoria, Australia DBG Health Full time $120,000 - $180,000 per yearAbout DBG HealthDBG Health is a leading Melbourne-based health, wellness, and beauty company with over 1,500 professionals dedicated to making wellbeing accessible to all. Our diverse portfolio, spanning Arrotex Pharmaceuticals, VidaCorp Consumer Brands, Independent Pharmacies of Australia Group, Axe Health Services and MyDNA reflects our commitment to...
-
Senior Data Engineer
1 day ago
Melbourne, Victoria, Australia Zitcha Full time $120,000 - $180,000 per yearAt Zitcha, we're at the forefront of revolutionizing the retail media landscape, and we invite you to join our visionary team making Retail Media better for everyone.As the world's firstAdaptive, Unified Retail Media Platform, Zitcha empowers retailers to unlock their full potential by seamlessly integrating planning, delivery, and insights across all...
-
Senior Data Engineer
1 day ago
Melbourne, Victoria, Australia iterate Full time $120,000 - $180,000 per yearWe are partnered with a successful, remote-first SaaS company who are looking for a Senior Data Engineer to fuel their next phase of growth as they invest heavily in their data function.You will be building a modern data environment from the ground up and playing a key role in defining what the analytics landscape looks like moving forward.This is a great...
-
Senior Data Engineer
1 week ago
Melbourne, Victoria, Australia Turing Consulting Full time $120,000 - $150,000 per yearRole Title: Senior Data EngineerLocation: Melbourne, Victoria, AustraliaContract Duration: 4 months (extendable)Description:Technical:Pyspark, Python, SparkSQL, SQL and Glue.AWS cloud experienceGood understanding of dimensional modellingGood understanding DevOps, CloudOps, DataOps, CI/CD & with a SRE mindsetUnderstanding of Lakehouse and DW...