Site Reliability Engineer - Europe
Remote- Fullerton, California, United States
Infrastructure
Job description
Why work for DreamHost?
We help people own their digital presence. It's not just wishful thinking. It's our noble cause — an idea that drives everything we do.
When you put your dreams online—your words, your photos, your creations—you shouldn't have to worry about your service provider mining that data for marketing purposes. Those are your dreams — not ours! DreamHost's open platform gives you the power to share your data and the freedom to control how it's used.
Working for DreamHost you can be a steward of your own career. We invest in our people, we promote from within the organization when possible and we offer extensive training to aid in your success. In this role, we offer the flexibility to work from home and during local business hours. We are proud to be a US employer with a fantastic culture and the ability to offer so much to our employees.
Overview
DreamHost is looking for a Site Reliability Engineer with experience deploying and maintaining large scale Linux environments. This role will be primarily focused on system and platform performance and infrastructure automation. Programming experience is required with at least one high level language such as Python or Perl. Experience is required with deploying Linux software and infrastructure via CI automation, and experience automating configuration through tools such as Ansible. Experience deploying large scale monitoring and datastore platforms with technology such as ELK, Grafana, Redis, Prometheus is a big plus, as well as hands on experience with VM and container platforms such as OpenStack, LXC, Podman, Docker, and Kubernetes. This role will involve building and growing platforms using the tools previously mentioned in order to support the operation of various teams at DreamHost. This role will be working closely with a group of Linux Systems Engineers, Developers, Security Engineers, Network Engineers, and Database Engineers in order to perform and succeed both as a team and individually.
Benefits Offered
- Competitive salary
- WorkTango team member recognition
- Udemy online learning courses
- Opportunities for profit-sharing bonuses
- Fun monthly events
- Opportunities for growth
- Laid-back atmosphere
Responsibilities
Monitor and test system performance across a variety of Linux based infrastructure in an automated manner; prepare and deliver performance statistics and reports.
Design and deploy new internal tools, applications, and operating systems in an automated and easily maintainable fashion.
Modify and improve existing tooling, automation, and processes to improve efficiency, improve performance, and reduce technical debt.
Create and maintain documentation as it relates to system and application configuration and processes.
Gauge the effectiveness and efficiency of existing systems; develop and implement strategies for improving or further leveraging these systems.
Assist in the planning and execution of operating system and software upgrades.
Provide general assistance for Tech Support, Operations, Data Center Operations.
Perform general documentation, classes, training, and diagramming
Conduct research on emerging technologies in support of systems development efforts, and recommend technologies that will increase cost effectiveness, systems flexibility, maintainability, and performance
Provide technical and process guidance to junior members of the team
Ensure compatibility and interoperability of deployed systems
Coordinate and perform in-depth tests, including end-user reviews, for modified and new systems
Conduct research on software and systems products to justify recommendations and to support purchasing efforts
Job requirements
Exceptional analytical, conceptual, and problem-solving abilities
Excellent architecture and technical support documentation skills
Strong interpersonal and consultative skills
Ability to conduct research into emerging technologies and trends, standards, and products as required
Able to prioritize and execute tasks in a high-pressure environment
Experience working in a team-oriented, collaborative environment
Ability to effectively present and communicate information to both technical and non-technical individuals and groups
Ability to define problems, collect data, establish facts, and draw valid conclusions
Ability to interpret an extensive variety of technical instructions in mathematical or diagram form and deal with several abstract and concrete variables
Experience with automation tools, ideally Ansible
Senior level systems operations experience in a large-scale Linux environment
Proven experience in overseeing the design, development, and implementation of software systems, applications, and related products
Proven experience with systems planning, security principles, and general software management best practices
Working technical knowledge of current software protocols, and Internet standards, including TCP/IP, DNS, HTTP
Excellent software troubleshooting experience
Working technical knowledge with Python and Bash
Experience building and consuming APIs
Competence with testing, flowchart, and data mapping tools and procedures
**DreamHost provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.
This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.**
or
All done!
Your application has been successfully submitted!