Andrew Tirto Kusumo

Senior Data Engineer

Building data infrastructure that scales.

About

Data Engineer with 7+ years of experience building scalable data pipelines, streaming architectures, and analytics platforms across fintech companies. Passionate about making teams work faster and better through reliable data infrastructure.

Languages

PythonSQLScalaJava

Big Data

Apache SparkKafkaFlinkHadoop

Orchestration

Apache AirflowDagsterPrefect

Cloud

AWS (Glue, Redshift, S3, EMR)GCP (BigQuery, Dataflow)Azure

Data Modeling

dbtData VaultStar Schema

Databases

PostgreSQLMySQLMongoDBCassandraDynamoDB

DevOps / Infra

DockerKubernetesTerraformCI/CD

Other

GitLinuxREST APIsGraphQL

Experience

Funding Societies

Senior Data Engineer

Nov 2024Present

AirflowAWSSnowflakeQLIKPythonDocker
  • Led the Finance & Risk DE Team to generate multiple reports for Finance Closing, FP&A Reports, ECL, and Regulatory Reports
  • Migrating legacy pipelines to a more sustainable approach using ECS
  • Handling several high priority projects with external partners

Paper.id

Senior Data Engineer

May 2024Oct 2024

AirflowdbtBigQueryArangoDBDatastreamPub/SubPythonDocker
  • Built a streaming pipeline from scratch using Google Datastream, Pub/Sub, and Dataflow to ingest data from App DB to BigQuery
  • Fixed existing dbt ELT inefficiencies, improving development time by ~100%
  • Reduced BigQuery costs by ~20% per month through targeted optimization
  • Created a cost management dashboard tracking project-level spend daily

Flip.id

Data Engineer Manager

Dec 2022May 2024

Senior Data Engineer

Sep 2021Dec 2022

dbtBigQueryDataflowDatastreamPub/SubPythonDockerGitLab CI
  • Built a streaming pipeline from scratch using Google Datastream, Pub/Sub, and Dataflow
  • Created end-to-end ELT pipelines with dbt, implementing tests and query dependencies
  • Reduced BigQuery costs ~20% per month through strict partitioning and clustering
  • Built and managed the Data Engineer team from zero — hiring, career framework, and processes
  • Developed a credit scoring POC for a new lending product with Docker and FastAPI
  • Provisioned Redash and Looker Studio dashboards for analysts and end users

JULO

Senior Data Engineer

Jan 2021Sep 2021

Data Engineer

Aug 2018Jan 2021

AirflowAWSGCPSparkPostgreSQLDockerCircleCIAnsible
  • Managed and maintained Airflow data pipelines running 24/7
  • Created database replicas for streaming to master DB for analytics
  • Deployed ML models using Docker and H2O with feature implementation in Django
  • Designed and implemented PostgreSQL 10 range partitioning for large tables
  • Built an action log data archiver for a DB with nearly a billion rows
  • Integrated CircleCI for automated testing and deployment

Projects

Streaming Pipeline

Built a real-time streaming pipeline from scratch to ingest data from application databases to Google BigQuery using Google Datastream, Pub/Sub, and Dataflow.

DatastreamPub/SubDataflowBigQueryPython

KUACI — Open Source KYC

Created an open-source KYC tool for Indonesian KTP that translates ID numbers into location, gender, and date of birth. Contributed to the GitHub Arctic Code Vault.

PythonOpen SourceGitHub

BigQuery Cost Optimization

Identified and resolved BigQuery cost inefficiencies through strict partitioning, clustering, and pipeline optimization — reducing monthly costs by ~20%.

BigQuerydbtSQLCost Management

DE Team from Zero

Built and managed the Data Engineer team at Flip.id from scratch — established hiring pipelines, defined career frameworks, and set up engineering processes to scale the team.

LeadershipHiringProcess DesignMentoring

Get In Touch

Have a question or want to work together? Feel free to reach out.