Posted 26 Apr 2024, 10:00 am
Senior Site Reliability Engineer at Luxury Presence
About Us
Luxury Presence is the fastest-growing digital platform for real estate agents, teams, and brokerages. Our award-winning real estate websites, modern marketing solutions, and AI-powered mobile platform help agents attract more business, work more efficiently, and serve their clients. Since launching in 2016, Luxury Presence has been trusted by more than 11,000 real estate professionals, including over 20 Wall Street Journal Top 100 agents.
Position
Team: Engineering / Infrastructure
Title: Sr. Site Reliability Engineer
Location: Latin America, Remote
Who is the Infrastructure Squad?
The infrastructure squad (internally known as SWAT Squad) is responsible for all cloud resources' reliability, scalability, and performance. We provide the foundation upon which Luxury Presence’s products are built. SREs lead the design, operation, and maintenance of our AWS cloud infrastructure, while also offering Engineering teams valuable frameworks, pipelines, and tooling to streamline production application design, deployment, and management.
What will you do as a Sr. Site Reliability Engineer (SRE)?
As an SRE, your role will involve collaborating with Infrastructure and Product Delivery teams, utilizing tooling and automation to enhance our Kubernetes clusters' readiness, improve observability, scale the system, optimize performance, and promote operational excellence practices.
• Identify and lead strategic initiatives that streamline platform operations, optimize platform performance, and improve system health.
• Design, build, test and maintain scalable and reliable infrastructure that supports workloads for all other teams.
• Manage and optimize Kubernetes clusters on AWS, ensuring high availability, scalability, and efficient resource utilization.
• Automate application deployments, continuous delivery, and monitoring changes to Kubernetes manifests using ArgoCD.
• Continuously improve our GitOps foundation with IaC tools like Terraform and Crossplane.
• Improve monitoring and observability to gain better insights into the platform's health, performance, and potential issues.
• Assist in the resolution of incidents, identifying root causes, and ensuring timely responses to critical issues affecting the platform.
• Analyze and optimize system performance, identifying and resolving bottlenecks in applications or infrastructure components.
• Develop and maintain automation scripts, tools, and workflows to streamline tasks and enhance efficiency.
• Document configurations, processes, and best practices, and sharing knowledge with the Engineering team to foster collaboration and learning.
• Ensure the platform adheres to security best practices and compliance requirements. Implementing necessary measures.
\n- 6+ years of professional combined experience as a Software Engineer and SRE.
- 3 years minimum of professional experience as SRE or DevOps Engineer.
- Expert-level knowledge in containerized environments for production workloads utilizing Kubernetes or similar container orchestration systems.
- MUST have working experience with cloud-native infrastructure such as AWS or GCP (ideally AWS).
- MUST understand AWS VPC, subnets, Network ACLs, Security Groups, IAM Role, EKS.
- Experience configuring Kubernetes RBAC Authorization, Ingress controller, ServiceAccount, and AWS role annotations.
- Experience in automating releases, continuous integration/delivery systems, and relevant tools (e.g., Jenkins, CircleCI, Github Actions, GitlabCI, etc.)
- Experience with infrastructure as code (ideally Terraform or Crossplane)
- Professional Experience with monitoring, observability systems such as Datadog and Prometheus.
- Ability to triage and resolve incidents, and lead incident investigations
- Professional experience with security practices, credential rotations, secrets management systems (ideally Vault).
- Hands-on experience setting up log aggregators such as Fluentbit, FluentD, or LogStash. Experience setting up platforms like ELK stack or hybrids is a plus.
- Working knowledge of GitOps, FluxCD, or ArgoCD is a plus
- Building a Kubernetes Operator is a plus.
- Terraform / ArgoCD / Crossplane.io for resources management.
- AWS as our Cloud Provider.
- Kubernetes through AWS EKS as our container orchestrator.
- Argo Events and Argo Workflows for internal pipelines.
- Vault project for secrets and password management.
- Prometheus and Datadog
Who we are: Luxury Presence is the real estate industry's most powerful marketing platform, providing award winning websites and cutting edge tech to the world’s top agents.
Founded in 2016 by Stanford Business School alumni Malte Kramer, Luxury Presence currently serves over 9,000 clients in the U.S. and Canada with its SaaS model — including over 20 of the top 100 WSJ real estate agents and teams. In addition, Luxury Presence is the official website partner to some of the industry's most powerful brokerages.
The Los Angeles-based SaaS company raised $25.9 million for its Series B round and recently announced $19.2M Series B-1. Bessemer Venture Partners led the round alongside fellow existing investors Toba Capital and Switch Ventures. Former Dallas Mavericks basketball player Dirk Nowitzki also participated in the round, along with other angel investors.
Its solutions include stunning website design, an engaging home search tool, an agent-to-agent listing referral network, powerful content & SEO strategies, expert-lead social media management, and digital advertising for lead generation. In 2020, Luxury Presence was recognized as a Best Place to Work by BuiltinLA and by Inc. as the 322nd fastest growing private company in America and then again in 2021 — LP ranked 598th.
Luxury Presence is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.
Please mention the word **FAITHFUL** and tag RMzQuMTQ1LjE0MS43OA== when applying to show you read the job post completely (#RMzQuMTQ1LjE0MS43OA==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
The offering company is responsible for the content on this page / the job offer.
Source: Remote Ok