Site Reliability Engineer (SRE)

Ankara • Technology / Technology Team • Tam Zamanlı

As Padran Information Technologies, we are looking for teammates who are focused on growth and success! In this position, you will work for our clients that we provide consultancy and you will take part in projects in Turkey's leading companies.


We are looking for a SRE who meets the following qualifications for our consulting business partner.



Requirements 

 

·      Hold a Bachelor of Science (BSc) degree in Computer Engineering or a related field.

·      Minimum 3 years of relevant experience in Platform Engineering, SRE, and/or DevOps in production environments.

·      Possess at least 2 years of experience with scripting languages (Bash, Python, NodeJS, Ruby, or PHP) and related automation projects.

·      Proficiency in “Infrastructure-as-Code” tools such as CloudFormation, Terraform, Chef, Ansible, and Puppet.

·      Experience in building distributed, failure-resistant architectures, including disaster recovery, backups, failover, etc.

·      Demonstrate operational understanding of monitoring systems and time-series databases (e.g., Prometheus, Grafana, Elasticsearch).

·      Proficient in setting up CI/CD pipelines and deployment tools (e.g., Jenkins, Git, GitHub, AWS Developer tools).

·      Strong spoken and written English communication skills.

·      Self-driven, responsible, eager to learn, and proactive.

·      Independent, goal-oriented, and proactive attitude.

·      Disciplined and effective in remote work environments.




What you will do

 

·      Define and monitor SLOs and SLIs for critical services to ensure they meet performance and reliability targets. 

·      Regularly review and adjust these metrics as necessary.

·      Lead and participate in incident response activities, including identifying, investigating, and resolving incidents to minimize impact on service availability and performance.

·      Conduct post-incident reviews (postmortems) to identify root causes and implement preventative measures.

·      Analyze system performance metrics and forecast capacity requirements to ensure adequate resources are available to support current and future workloads.

·      Identify opportunities for performance optimization and efficiency improvements.

·      Continuously evaluate and improve processes, tools, and infrastructure to enhance reliability, efficiency, and scalability.

·      Stay up-to-date with industry trends, emerging technologies, and best practices, and drive innovation within the organization.

·      Monitor system health and performance using monitoring tools and alerting systems and respond promptly to alerts and incidents.

·      Drive efficiency by automating repetitive tasks and processes.

·      Evaluate and implement technology options for managing our enterprise products both on-cloud and on-premise.

·      Enhance our platform by identifying areas for improvement based on monitoring data.

·      Ensure robust security practices by leveraging industry best practices and available tools.

·      Regularly assess and enhance security measures.

·      Collaborate with security teams to implement and maintain compliance standards.

·      Be the go-to expert for our platform services.

·      Work closely with the development team to create a development environment that fosters productivity and innovation.

·      Propose and drive adoption of new solutions that enhance our platform.

·      Diagnose and resolve complex system and application issues promptly.



Why Padran Information Technologies?


- Opportunity to work with leading companies in Turkey

- Opportunity to use industry-leading technologies with our business partners Microsoft, IBM, AWS and Open Text

- Career development and certification opportunities as an ISTQB accredited training center