We’re looking for a skilled engineer to manage and optimize Amazon RDS MySQL for high write-intensive workloads (up to 50,000 QPS). You’ll be responsible for configuring real-time data streaming with Debezium and Kafka, and for designing effective strategies for replication, sharding, indexing, and partitioning.
Your role will also include analyzing and tuning SQL query performance, ensuring high system availability, and maintaining reliable backup and disaster recovery mechanisms. You’ll integrate datasets with analytical platforms such as ClickHouse and Snowflake, and leverage Infrastructure as Code (Terraform or AWS CDK) to automate deployments. Schema versioning and deployments will be managed through Git and CI/CD pipelines. What You’ll Bring * Proven experience managing MySQL RDS under heavy production workloads. * Deep understanding of binary log replication and change data capture (CDC) using Debezium. * Strong expertise in query performance tuning, indexing strategies, and slow query analysis. * Proficiency in Bash or Python scripting. * Familiarity with Git-based workflows and CI/CD practices. * Solid grasp of data pipelines connecting to ClickHouse and Snowflake.
Nice to Have * Experience working with AWS Glue or Apache Airflow for data pipeline orchestration. * Knowledge of ClickHouse ingestion best practices. * Familiarity with monitoring tools like Prometheus and CloudWatch.