RudderStack: Site Reliability Engineer II

*Our roles are remote first, and can be based anywhere in India (#LI-Remote).

Responsibilities

Monitor and continually improve the capacity of our production environment
Design and implement scalable, reliable, and efficient infrastructure using Kubernetes, Terraform, AWS resources.
Partner with development teams to improve services through rigorous testing and release procedures with CI pipelines (Github Actions, Dockerfiles)
Gain a deeper understanding of RudderStack infrastructure and help debug incidents
Proactively build software to help operations and support teams
Identify opportunities for process improvements, automation, and cost savings

Requirements

A Bachelor or Master degree in Computer Science or equivalent experience is required
5+ years of experience as a Site Reliability Engineer, Internal Platform Developer or similar role
Strong understanding of cloud computing, containers, and DevOps practices
Demonstrated Linux experience
Excellent debugging skills
Experience with Scripting and infrastructure automation
Familiarity with distributed systems design patterns using tools such as Kubernetes
Familiarity with AWS, Azure or Google Cloud Compute
Excellent verbal and written communication skills
Familiarity with Networking concepts like VPCs, proxies and CDNs

Here are examples of things we've worked on:

Build and maintain a Kubernetes platform to deploy all our applications with high availability
Build Kubernetes operator to automate 100s of deployments
Managed 100s of postgres with HA for our deployments
Provision and manage air-gapped on-premise deployments in diverse environments.
Manage multi-region multi-cluster environment with hundreds of customer deployments in single-tenant and multi-tenant models.
Complete Infrastructure as a code and enforced using GitOps model
Automated migrations of complex, highly available services
Working on compliance(i.e. SOC2 Type 2, HIPPA), security, scalability, and a lot more aspects to deliver top class, secure software
We follow FinOps and continuously optimize our cloud costs.

How we achieve results:

Empathy for the problems encountered by our customers.
Collaboration with engineering teams to achieve results.
Care deeply about the quality of your and the team's code
Curiosity and understanding, for investigating causes and finding effective solutions.
Output driven to provide value to our customers in a significant, measurable, and positive way.
Focus on writing testable, performant, bug-free code to provide the right solutions to the problems.

Please mention the word **SHARPEST** and tag RMjA5LjIyMi4yMS42Mg== when applying to show you read the job post completely (#RMjA5LjIyMi4yMS42Mg==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Name	Domain	Expiration	Description	Type
cc_cookie_d1	nomadswork.com	1 Year	Storage of the selection in the cookie layer.	Cookie
_csrf	nomadswork.com	10 minutes	Protection against counterfeiting through cross-website requirements.	Cookie
connect.sid	nomadswork.com	Session	Login session for nomadswork.com	Cookie
hmt_id	hcaptcha.com	1.30 Days	Used for strictly necessary anonymous service-related statistics and for other technical purposes such as availability assistance.	Session
INGRESSCOOKIE, __cfduid, __cflb, session, sessionid	hcaptcha.com	Varies; up to 30 days	Used for strictly necessary technical purposes: load balancing, routing. See further details.	Session
hc_accessibility	hcaptcha.com	Varies; up to 30 days	Used for strictly necessary technical purposes: enables the user to use the accessibility. See further details.	Session
__stripe_mid	stripe.com	Session	Fraud prevention and detection	Cookie
__stripe_sid	m.stripe.com	Session	Fraud prevention and detection	Cookie
m	m.stripe.com	Session	Fraud prevention and detection	Cookie
session	stripe.com	2 months, 29 days	Login session for Stripe Dashboard	Cookie
lsession	stripe.com	7 days	Login session for Stripe Express Dashboard, for Stripe Express users	Cookie
stripe.csrf	stripe.com	1 year	Protection against counterfeiting through cross-website requirements, for users of Stripe Dashboard	Cookie
cliauth_secret	stripe.com	Session	To confirm authentication for the Stripe CLI	Cookie
art_token, cbt_token, cct_token, cdt_token, ect_toksvt_token, lc_token, prt_token, act_token	stripe.com	Session	To confirm authentication for account recovery, bank account changes, login challenges, password resets, support requests, adding an email or a new device	Cookie
NID	stripe.com	Session	Used by reCAPTCHA, an extra security measure that is sometimes used when logging into Stripe.	Cookie
locale	stripe.com	Session	Localization setting for the language used on the website and in the documents	Cookie
country	stripe.com	Session	Localization settings for the country to customize the availability of the product and features	Cookie
lang	stripe.com	Session	Programming language for the code examples in Stripe documents	Cookie
has_intentionally_selected_curl	stripe.com	Session	Displays the code examples in Curl in Stripe documents	Cookie
persisted-tab-#{id}	stripe.com	Session	When the page is updated, it remembers which document tab you are on	Cookie
disable_cmd_f_override	stripe.com	Session	Deactivates the search shortcut cmd + f / ctrl + f for stripe documents and uses the standard behavior of the browser instead (only searches the current page)	Cookie
double_cmd_f_uses	stripe.com	Session	Tracks the use of the shortcut cmd + f / ctrl + f in Stripe documents; to improve usability by not showing the user again a function that he has already used	Cookie
expanded-topics	stripe.com	Session	When page updates are made, remembers which topics are expanded in Stripe documents	Cookie
checkout-test-session, checkout-live-session	stripe.com	Session	To provide the memory function of Legacy Checkout	Cookie
_ga, _gat, _gat_UA-12675062-5, _gid	stripe.com	Session	Google Analytics cookies for analysis and to improve services	Cookie
cid	stripe.com	Session	Stripe analytics "Client ID" to improve services	Cookie
site_sid, __stripe_id	stripe.com	2 hours, 30 minutes	description ...	Cookie
__stripe_orig_props	stripe.com	Session	To assess the effectiveness of marketing campaigns	Cookie
__utma, __utmb, __utmc, __utmt, __utmz	runkit.com	10 minutes	Runkit’s Google Analytics	Cookie
_mkto_trk	marketo munchkin	Session	Tracks page views and the effectiveness of email campaigns	Cookie
muc	twitter	Session	Stripe Atlas Twitter Marketing Campaigns	Cookie
_fbp	facebook.com	Session	Facebook advertising	Cookie
fr	facebook.com	Session	Facebook advertising	Cookie
bcookie, bscookie, lang, Li_sugr, lidc, UserMatchHistory	linkedin.com	Session	LinkedIn advertising	Cookie
IDE	google.com	Session	Google advertising	Cookie
Lidc, Li_sugr	linkedin.com	Session	LinkedIn Insights Tag for Marketing Solutions	Cookie

Name	Domain	Expiration	Description	Type
__tawkuuid	tawk.to	10 years, 2 days	This cookie is placed when using the customer support chat.	Cookie
TawkConnectionTime	tawk.to	Session	This cookie measures the time spent on the Website	Cookie
twk_60e84b5f649e0a0a5ccb6065	tawk.to	10 Jahre, 2 Tage	This cookie is placed when using the customer support chat.	Cookie

Posted 4 Jun

Site Reliability Engineer II at RudderStack

Manager Site Reliability & DevOps

Endpoint Clinical

Job 22929 Developer .Net and React Pleno Brasil

CI&T

$1000 Weekly Work from Home Personal Assistant

MasterCraft

Posted 4 Jun

Site Reliability Engineer II at RudderStack

Similar Job Offers

Manager Site Reliability &amp; DevOps

Endpoint Clinical

Job 22929 Developer .Net and React Pleno Brasil

CI&amp;T

$1000 Weekly Work from Home Personal Assistant

MasterCraft

Sign up and get weekly remote devops & admin job offers to your mailbox.

Manager Site Reliability & DevOps

CI&T

Sign up and get weekly
remote devops & admin job offers
to your mailbox.