As a Site Reliability Engineer (SRE) with a focus on Application Support, you will be responsible for ensuring the stability, performance, and continuous improvement of a complex ecosystem in the retail industry.
Location : Lisbon / Porto (Hybrid)
Availability : ASAP
Responsibilities :
- Providing technical support to internal and external teams
- Gathering data and troubleshooting integration issues between applications
- Monitoring Squad systems and proactively identifying anomalies
- Responding to incidents and performing root cause analysis to prevent recurrence
- Collaborating across technical and business teams
- Managing multiple concurrent issues and helping prioritize with the team
- Participating in on-call rotations
- Communicating effectively with non-technical stakeholders
Required Skills :
3 years of experience in Application Support or SRE rolesSolid understanding of ELK Stack (Elasticsearch, Logstash, Kibana), Prometheus, Grafana, and AWS (Amazon Web Services)Familiarity with collaborative platforms : Jira, Confluence, GitLabAnalytical mindset : distinguish errors from systemic problemsExperience with operationalizing microservicesCapacity to read logs and monitor metrics to identify and act on issuesProactive attitude in suggesting improvements (logs, metrics, alarms, tools)Fluent Portuguese and English (spoken & written)Interested? Send your CV and daily rate to or apply directly.