Site Reliability Engineer - Fed Ramp
We are seeking a Site Reliability Engineer who will improve and maintain software development, test and live infrastructure and services. You will articulate and have experience with Linux and other *NIX- derivatives. Your primary mission as a cloud engineer is working with the development, technical operations, quality assurance, and product management teams, to ensure the uptime and performance of Trellix's Cloud Security Solutions for Corporate clients and US Government agencies.
You will support Trellix's Cloud Security Solution, a mission-critical, front-end and back-end systems and applications in production environment which includes US Government specific deployment and support procedures
You will help us identify and improve infrastructure and system reliability, performance, monitoring, and overall stability of McAfee Cloud Security systems
Capacity planning and demand forecasting to meet systems demand, identifying performance bottlenecks and conceiving tuning improvements
Build tools and automation that eliminate repetitive tasks and prevent incident occurrence
You will help us create operational runbooks and documentation
Participate in 24×7 operational support and on-call rotation shifts
You will have 5+ years of production applications and systems support
You will have experience supporting, analyzing and troubleshooting large-scale distributed mission-critical systems
You will have a systematic approach and strong sense of ownership to bring problems to resolution
You will have experience configuring and managing web servers (Apache, Tomcat, Nginx) and RESTful web service applications
You will have knowledge of Linux systems administration and architecture
You will have experience configuring and managing virtualized environments
Proficiency working with Amazon Web Services (AWS) like EC2, EBS, ELB, S3, and EMR in a available and scalable production environment
You will have experience with continuous integration and deployment automation tools such as Jenkins, Harness, AWS CloudFormation, Salt, or Puppet, Chef, Ansible
You will have experience with SQL (MySQL) and databases (Redis, CouchBase, Cassandra, Crate)
Experience with open-source technologies (Kafka, Memcached, Redis, Hadoop, HBase, Zookeeper, Oozie)
Network knowledge (TCP/IP, UDP, DNS, Load balancing) and prior network administration experience is a big plus
Extensive scripting experience with Shell, Python or Golang
Experience documenting processes, systems, environments and runbook procedures
Experience with source control tools such as Github, SVN, or Perforce
Company Benefits and Perks:
We work hard to embrace diversity and inclusion and encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.
Pension and Retirement Plans
Medical, Dental and Vision Coverage
Paid Time Off
Paid Parental Leave
Support for Community Involvement
We're serious about our commitment to diversity which is why we prohibit discrimination based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.
Trellix is a global company redefining the future of cybersecurity. The company's open and native extended detection and response (XDR) platform helps organizations confronted by today's most advanced threats gain confidence in the protection and resilience of their operations. Trellix's security experts, along with an extensive partner ecosystem, accelerate technology innovation through machine learning and automation to empower over 40,000 business and government customers. More at ~~~ .