Ciklum is looking for a Site Reliability Engineer to join our team full-time in Ukraine.
We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer technology that redefines industries and shapes the way people live.
About the role:
As a Site Reliability Engineer, become a part of a cross-functional development team engineering experiences of tomorrow.
Our Client is the world’s leading social trading network, providing millions of users from around the world, a one-stop-shop solution for their trading and investing needs. Since 2007, our Client has positioned itself as a Fintech leader, pioneering revolutionary practices such as social trading and machine-learning-powered investment products.
Client’s award-winning trading and investing platform is at the forefront of Fintech technology.
Our Research and Development department develops cutting-edge technologies, focusing on field-relevant areas like blockchain and artificial intelligence. We promote an organizational culture that is both professional and fun.
The Client’s R&D is currently seeking an ambitious SRE Infrastructure Engineer to join our growing R&D team.
The role responsibilities include the development of features, tools, and observability infrastructure.
You will Work collaboratively with professional developers, software architects, and SRE, in a fast-paced, agile environment.
Responsibilities: * Own and evolve observability systems — ensuring platform resilience, reliability, and full visibility across all environments * Provide technical leadership, defining architecture, standards for Observability hands-on solutions * Drive performance-health initiatives by establishing SLIs/SLOs, identifying bottlenecks, and maintaining service reliability * Manage vulnerability tracking and remediation for observability tools and infrastructure, ensuring compliance and timely patching * Build and maintain self-service capabilities that let RND teams configure dashboards, alerts, and metrics independently * Collaborate with Product, Security, and Infrastructure teams to align observability strategy and integrations * Implement and automate CI/CD processes for observability components and tool maintenance * Work directly with modern technologies such as K8S, Prometheus, Grafana, Coralogix, DataDog, Splunk and Azure * Set and enforce observability coding practices, review contributions, and provide constructive technical feedback * Be the primary contact for observability best practices and platform reliability
Requirements: * 2 years of technical hands-on experience in SRE/ Monitoring roles with deep knowledge of observability tooling and principles * Proven expertise with metrics/logs-based monitoring and alerting (Prometheus, Grafana, DataDog, Coralogix, Open Telemetry, etc.) * Strong understanding of cloud environments (preferably Azure) and container orchestration (K8S) * Experience with vulnerability management and coordination of security fixes for infrastructure and observability stacks * Demonstrated success creating self-service observability solutions for development teams * Solid background in automation and CI/CD pipelines for monitoring and logging systems * Experience leading cross-functional reliability or performance projects and defining SLIs/SLOs * Excellent communication and leadership skills for mentoring engineers and driving cross-team alignment * Analytical, detail-oriented mindset with ownership of system uptime and performance
Personal skills: * Fast learner * An eye for detail, strong logic, and analytical skills * Ability to think critically and work under pressure to resolve incidents and troubleshoot complex systems * Ability to mentor junior engineers and share knowledge * Strong spoken and written English * Excellent communication skills, both written and verbal * Collaborating and ability to work with both technical and non-technical teams * A collaborative, problem-solving mindset with a passion for improving systems and delivering reliability at scale
What’s in it for you? * Strong community: Work alongside top professionals in a friendly, open-door environment * Growth focus: Take on large-scale projects with a global impact and expand your expertise * Tailored learning: Boost your skills with internal events (meetups, conferences, workshops), Udemy access, language courses, and company-paid certifications * Endless opportunities: Explore diverse domains through internal mobility, finding the best fit to gain hands-on experience with cutting-edge technologies * Flexibility: Enjoy radical flexibility — work remotely or from an office, your choice * Care: We’ve got you covered with company-paid medical insurance, mental health support, and financial & legal consultations
About us:
At Ciklum, we are always exploring innovations, empowering each other to achieve more, and engineering solutions that matter. With us, you’ll work with cutting-edge technologies, contribute to impactful projects, and be part of a One Team culture that values collaboration and progress.
As one of Ukraine’s largest IT companies and a top employer recognized by Forbes, we’ve spent over 20 years delivering meaningful tech solutions. We proudly support diverse talent and military veterans, recognizing their unique skills and perspectives they bring to shaping the future.
Want to learn more about us? Follow us on Instagram, Facebook, LinkedIn.
Explore, empower, engineer with Ciklum!
Interested already? We would love to get to know you! Submit your application. Can’t wait to see you at Ciklum.