Remote Senior DevOPS Engineer


HireLatam is a premier recruitment agency that places top Latin American talent in independent contractor roles in US companies. With a proven track record and a commitment to excellence, we're your trusted partner in the pursuit of career success. Our extensive network, personalized approach, and supportive guidance ensure that you're in the best hands to find your next job opportunity.



Job Title: Remote Senior DevOPS Engineer (100% Work From Home)

Location: Remote from Latin America, Mexico City preferred


Position Type: Full-time


Salary: $70,000 - $100,000 USD/year based on experience


Schedule: Monday - Friday, business hours Central Time


Job Overview:

The company was founded by repeat fintech founders and focuses on modernizing payment processing infrastructure in Mexico. They specialize in providing secure, efficient, and user-friendly financial services that businesses rely on daily. As they continue to grow, they seek top talent to join their team and help shape the future of digital payments.


This is a full-time position, with a preference for candidates located in Mexico City, though remote work will be considered. As a Senior DevOps Engineer, you will work on developing cutting-edge payment solutions some of Mexico’s largest businesses depend on daily. You will be responsible for designing, implementing, and managing a robust Kafka-based messaging infrastructure that serves as the communication backbone across critical backend services for our core payment processor.

You will be able to contribute to a fast paced team that regularly ships new products that are used by millions of users. You'll be expected to collaborate closely with the company’s founders, contribute to the architecture and design of our platforms, and ensure the delivery of high-quality, scalable software.


Our Client's Products:

Client Invoicing - “Pay Center”: Their invoicing product that powers some of Mexico’s largest companies.

eCommerce Checkout - “Pay Link”: Their smart 1-click checkout product that reduces friction at the checkout for customers.

Note: You must Include your GitHub link and product examples of your work to be considered.

Responsibilities:

  • Automating the deployment, management, and operations of complex distributed systems with Apache Kafka.
  • Implement tracing and performance observability in high scale distributed microservice architectures.
  • Design and manage scalable, high-throughput, and low-latency Kafka clusters for real-time data streaming between services.
  • Build and maintain infrastructure as code (IaC) for Kafka and related services using Terraform, Ansible, or similar tools.
  • Monitor and optimize Kafka performance, ensuring message reliability and minimal downtime in a high-availability payment environment.
  • Set up and maintain centralized observability systems for logs, metrics, and traces across all services using Prometheus, Grafana, or Datadog.
  • Design and maintain CI/CD pipelines for infrastructure and microservices using tools such as GitHub Actions, and Jenkins.
  • Manage containerized workloads using Docker and Kubernetes, ensuring scalability, and automated rollouts/rollbacks in production
  • Collaborate with backend engineers, SREs, and platform teams to implement Kafka producers/consumers that integrate cleanly with payment processing flows.
  • Establish security, access control, and encryption protocols for Kafka to meet regulatory and compliance standards (e.g., PCI DSS).
  • Lead Kafka upgrades, partition strategy design, and rebalancing without disrupting critical microservices.
  • Implement observability tooling for Kafka (e.g., Confluent Control Center, Prometheus/Grafana, or Datadog integrations).
  • Develop disaster recovery and failover strategies for Kafka-related components in production.
  • Participate in incident response processes for Kafka-related outages.
  • Strong communication skills in both English and Spanish.