Senior DevOps Engineer

Bogotá D.C., Bogotá DC
Publicado hace 7 días
Logo Torre
Empresa:
Torre
Descripción de la Empresa:
Ayudamos a postear ofertas de trabajo de diferentes empresas que trabajan con nosotros para un mejor y rápido flujo.
Tipo de Contrato:
Tiempo Completo
Experiencia requerida:
1 año
Educación requerida:
Universitaria
Género:
Indiferente
Cantidad de Vacantes:
1

Descripción de la Oferta de Trabajo

We're helping one of our clients, Web Shop Manager, hire for a Senior DevOps Engineer.
“Empowering eCommerce success globally with advanced product data expertise and superior customer experience!.”

Compensation: To be agreed upon.

Location: Remote (for México, Argentina, Brazil and Colombia residents).

Skills: AI (Artificial intelligence), English, RAG (Retrieval-augmented generation) and GCP (Google Cloud Platform).

Responsibilities and more:

We are seeking a full-stack developer ready to work with new technologies and architectures in a forward-thinking organization that constantly pushes boundaries.
This role involves developing and maintaining a high-performing, fully automated QA and CI/CD pipeline. We release new versions of our platform every week, with our application infrastructure running on AWS and primarily orchestrated by Terraform and EC2 instances.

We need a Senior DevOps Engineer with experience on the AWS platform to complete our Site Reliability/DevOps team. This individual will lead the research and development of architectural changes from the infrastructure up. They will take ownership of our AWS, CloudFormation, Terraform, and CI/CD pipeline, collaborating with our software development team to design and build improvements that help us scale reliably as we expand our customer base. Additionally, this role involves mentoring other engineers to design and implement scalable, highly reliable software. You will actively refactor existing system and application code in this hands-on position.

Infrastructure as Code (IaC): 

- Collaborate with development teams to design, implement, and maintain infrastructure using tools like Terraform, or CloudFormation.
- Automate provisioning, configuration, and scaling of cloud resources.

Monitoring and Incident Response:

- Set up monitoring and alerting systems (e.g., Datadog, OpenTelemetry, NewRelic, Prometheus, OpsGenie, StatusPage) to track service-level indicators (SLIs) and respond to incidents promptly.
- Participate in on-call rotations, diagnosing and resolving production issues.

Capacity Planning and Performance Optimization:

- Analyze system performance, identify bottlenecks, and optimize resource utilization.
- Work closely with developers to improve application performance.

Reliability Engineering:

- Define and track service-level objectives (SLOs).
- Implement chaos engineering practices (e.g., game days, fault injection) to validate system resilience.
- Implement load testing on services (K6 Grafana, Artillery).

Continuous Integration and Deployment (CI/CD):

- Enhance CI/CD pipelines, ensuring smooth and reliable software releases.
- Implement blue-green deployments, canary releases, and feature flags.

Security and Compliance:

- Collaborate with security teams to ensure compliance with industry standards (e.g., CIS, NIST, SOC, ISO).
- Implement security best practices, including access controls, encryption, and vulnerability scanning.

Documentation and Knowledge Sharing:

- Document infrastructure, processes, and incident response procedures.
- Share knowledge with team members through internal workshops or presentations.

Required skills and qualification:

- Education: Bachelor’s degree in computer science, software engineering, or a related field. Master’s degree preferred.
- Experience: Minimum of 5 years in DevOps or SRE roles.
- Proficiency in cloud platforms (AWS, GCP) and container orchestration (Kubernetes, Docker).
- Proficiency with Terraform and Cloudformation.

Programming and Scripting:

- Strong scripting skills (Python, Bash, or similar).
- Expert with Git, Github (actions) and version control.

Collaboration and Communication:

- Excellent teamwork and communication skills.
- Ability to work across cross-functional teams.

Problem-Solving and Analytical Skills:

- Proven ability to troubleshoot complex issues and optimize system performance.

Certifications (optional):

- AWS Certified DevOps Engineer, Google Professional DevOps Engineer, or similar.
- Ability to write and maintain clear and concise technical documentation.
- Experience in search technologies such as full-text lexical search and/or semantic vector-based search.
- Strong understanding and experience with CI/CD pipelines.
- Experience with ML, ML embeddings and generative AI.
- Experience with Retrieval-Augmented Generation (RAG).
- Experience working with AWS Bedrock.
- Experience in building web applications.
- Experience in designing and building infrastructure for RESTful and GraphQL APIs.
- Experience with data lakes, and processing large datasets and working with Big Data and Analytics.
- Knowledge of Ruby, Python, Javascript (Node.JS, React, Typescript, ES6, Cypress), Go, Rust, Elixir.
- Experience working with data processing frameworks and services such as Apache Spark and Airflow.
- Experience with databases such as PostgreSQL, ElasticSearch, MongoDB, influxDB, Milvus, and Quadrant.
- Experience with Messaging Queues such as RabbitMQ, AWS SQS and Kafka.
- Excellent debugging and optimization skills.
- Experience in unit/integration testing.
- An octopus called Dave.

If you are a talented DevOps Engineer with a passion for creating innovative solutions and thriving in a fast-paced startup environment, we would love to hear from you.

Empleos relacionados a Senior DevOps Engineer