Sumo Logic: Site Reliability Engineer

* Role can be remote - from anywhere in Poland

We are a cloud-native SaaS machine data analytics platform, solving complex monitoring problems for DevOps, SecOps and ITOps teams. Our customers, including Airbnb, Twitter, BBC and Toyota, choose our solution because it allows them to easily monitor and optimise their large scale applications.

Our micro services architecture in AWS ingests hundreds of TB daily across many geographic regions. We also have short release cycles and no legacy versions to maintain. We write in Scala and use open-source technologies such as Kafka, Kubernetes and Cassandra.

As a Site Reliability Engineer you will work towards enhancing the reliability of Sumo Logic product. Our customers rely not only on a rich feature set of the product but also on it being always available - often it’s their primary tool for maintaining their own software.

The SRE team is unique in Sumo Logic as it doesn’t own any product service, you will work towards the whole codebase of the Sumo Logic product. You will identify the weakest links in either reliability or performance, research and benchmark possible improvements, and implement solutions in cooperation with the owning teams. You will not focus narrowly, there’s a broad spectrum of topics and projects you might get involved in. You will not operate the software, but create tools for other teams to increase visibility, observability, and scalability of Sumo Logic services.

As a Senior Site Reliability Engineer you will:

Deal with software which processes data at a huge scale
Identify reliability improvement areas based on past evidence of production incidents
Program in Scala
Research, benchmark, optimise and implement solutions aiming at improving the performance and reliability of our product
Work with other teams in Sumo Logic Engineering to increase the observability of their services, share reliability knowledge, automate toil, improve their tooling and replace manual processes

Example projects:

SLI (Service Level Indicator) monitoring
Performance measurements and visualisation tooling (perf, ebpf, flamegraphs)
Configuration as Code
Optimising usage of cloud services (AWS)

You have:

At least BSc in Computer Science or related field
6+ (Senior) / 9+ (Staff) years of professional experience
Good coding skills in any language. Object oriented languages are preferred
Strong troubleshooting skills in complex systems
Ability to rapidly learn new software, frameworks, open source tools and development languages
Strong knowledge of large-scale internet service architecture (e.g. load balancing)
Strong understanding of Unix and TCP/IP fundamentals

Ideally you also have:

Experience with performance, scalability, and reliability issues of 24x7 commercial services
Proficiency with the Amazon AWS ecosystem
Self-driven and being proactive
Configuration and maintenance of common infrastructure such as Apache ZooKeeper, HAProxy
Experience working in a test-driven environment

Why it’s worth applying:

Great salary - employment contract (65% authorship costs).
Strong engineering teams.
Stock (RSU) grant.
$2000 / year education budget + 2 extra days off.
4 extra days off in 2022 (Sumo Wellness Days).
Hack weeks and tech talks.
Private healthcare for you and your family.
Medical and life insurance.
Sports card.
WFH budget.
Lunch budget.
Individual English lessons with a native speaker.
You can work from the office, 100% remotely or in a hybrid model.

#LI-Remote

#LI-AO1

Be sure to mention the word **RECONCILIATION** and tag RMTk1LjIwLjI0MS40OQ== when applying to show you read the job post completely. RMTk1LjIwLjI0MS40OQ==This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Name	Domain	Expiration	Description	Type
cc_cookie_d1	nomadswork.com	1 Year	Storage of the selection in the cookie layer.	Cookie
_csrf	nomadswork.com	10 minutes	Protection against counterfeiting through cross-website requirements.	Cookie
connect.sid	nomadswork.com	Session	Login session for nomadswork.com	Cookie
hmt_id	hcaptcha.com	1.30 Days	Used for strictly necessary anonymous service-related statistics and for other technical purposes such as availability assistance.	Session
INGRESSCOOKIE, __cfduid, __cflb, session, sessionid	hcaptcha.com	Varies; up to 30 days	Used for strictly necessary technical purposes: load balancing, routing. See further details.	Session
hc_accessibility	hcaptcha.com	Varies; up to 30 days	Used for strictly necessary technical purposes: enables the user to use the accessibility. See further details.	Session
__stripe_mid	stripe.com	Session	Fraud prevention and detection	Cookie
__stripe_sid	m.stripe.com	Session	Fraud prevention and detection	Cookie
m	m.stripe.com	Session	Fraud prevention and detection	Cookie
session	stripe.com	2 months, 29 days	Login session for Stripe Dashboard	Cookie
lsession	stripe.com	7 days	Login session for Stripe Express Dashboard, for Stripe Express users	Cookie
stripe.csrf	stripe.com	1 year	Protection against counterfeiting through cross-website requirements, for users of Stripe Dashboard	Cookie
cliauth_secret	stripe.com	Session	To confirm authentication for the Stripe CLI	Cookie
art_token, cbt_token, cct_token, cdt_token, ect_toksvt_token, lc_token, prt_token, act_token	stripe.com	Session	To confirm authentication for account recovery, bank account changes, login challenges, password resets, support requests, adding an email or a new device	Cookie
NID	stripe.com	Session	Used by reCAPTCHA, an extra security measure that is sometimes used when logging into Stripe.	Cookie
locale	stripe.com	Session	Localization setting for the language used on the website and in the documents	Cookie
country	stripe.com	Session	Localization settings for the country to customize the availability of the product and features	Cookie
lang	stripe.com	Session	Programming language for the code examples in Stripe documents	Cookie
has_intentionally_selected_curl	stripe.com	Session	Displays the code examples in Curl in Stripe documents	Cookie
persisted-tab-#{id}	stripe.com	Session	When the page is updated, it remembers which document tab you are on	Cookie
disable_cmd_f_override	stripe.com	Session	Deactivates the search shortcut cmd + f / ctrl + f for stripe documents and uses the standard behavior of the browser instead (only searches the current page)	Cookie
double_cmd_f_uses	stripe.com	Session	Tracks the use of the shortcut cmd + f / ctrl + f in Stripe documents; to improve usability by not showing the user again a function that he has already used	Cookie
expanded-topics	stripe.com	Session	When page updates are made, remembers which topics are expanded in Stripe documents	Cookie
checkout-test-session, checkout-live-session	stripe.com	Session	To provide the memory function of Legacy Checkout	Cookie
_ga, _gat, _gat_UA-12675062-5, _gid	stripe.com	Session	Google Analytics cookies for analysis and to improve services	Cookie
cid	stripe.com	Session	Stripe analytics "Client ID" to improve services	Cookie
site_sid, __stripe_id	stripe.com	2 hours, 30 minutes	description ...	Cookie
__stripe_orig_props	stripe.com	Session	To assess the effectiveness of marketing campaigns	Cookie
__utma, __utmb, __utmc, __utmt, __utmz	runkit.com	10 minutes	Runkit’s Google Analytics	Cookie
_mkto_trk	marketo munchkin	Session	Tracks page views and the effectiveness of email campaigns	Cookie
muc	twitter	Session	Stripe Atlas Twitter Marketing Campaigns	Cookie
_fbp	facebook.com	Session	Facebook advertising	Cookie
fr	facebook.com	Session	Facebook advertising	Cookie
bcookie, bscookie, lang, Li_sugr, lidc, UserMatchHistory	linkedin.com	Session	LinkedIn advertising	Cookie
IDE	google.com	Session	Google advertising	Cookie
Lidc, Li_sugr	linkedin.com	Session	LinkedIn Insights Tag for Marketing Solutions	Cookie

Name	Domain	Expiration	Description	Type
__tawkuuid	tawk.to	10 years, 2 days	This cookie is placed when using the customer support chat.	Cookie
TawkConnectionTime	tawk.to	Session	This cookie measures the time spent on the Website	Cookie
twk_60e84b5f649e0a0a5ccb6065	tawk.to	10 Jahre, 2 Tage	This cookie is placed when using the customer support chat.	Cookie

Posted 30 Jun

Site Reliability Engineer at Sumo Logic

Data Collection Participant Photo Submission Task

CrowdGen by Appen

Aprio PH FP&A Analyst

Aprio

Government Relations Officer

Injective Labs

Posted 30 Jun

Site Reliability Engineer at Sumo Logic

Similar Job Offers

Data Collection Participant Photo Submission Task

CrowdGen by Appen

Aprio PH FP&amp;A Analyst

Aprio

Government Relations Officer

Injective Labs

Sign up and get weekly remote other job offers to your mailbox.

Aprio PH FP&A Analyst

Sign up and get weekly
remote other job offers
to your mailbox.