Friday 1 August 2014

Senior Service Operations Engineer, Monitoring and Measurement | LinkedIn | Mountain View, California


Senior Service Operations Engineer, Monitoring and Measurement | LinkedIn | Mountain View, California


Senior Service Operations Engineer, Monitoring and Measurement

LinkedIn - Mountain View, California

Posted 2 days ago
This is a preview of the Senior Service Operations Engineer, Monitoring and Measurement job at LinkedIn. To view the full job listing, join LinkedIn - its free!

About this job

Job description

Service Operations Engineer, Monitoring and Measurement-Mountain View, CA

Is monitoring your passion? Do you get a thrill out of catching all potential issues and fixing them before they wreak havoc? Do you dream of working in the best engineering culture around? If so, then LinkedIn wants a word with you. We are looking for a dynamic, seasoned monitoring engineer who will find the best possible method for monitoring our entire web presence and keeping it up (homegrown, 3rd party), automate all that is possible to automate, make sound data analysis, guide others on best practices for monitoring, and who has a tendency for scope creep. With hundreds of millions of users and millions of metrics to choose from, it is a monitoring engineer’s playground. Take a look at our blog: https://engineering.linkedin.com/tags/monitoring. Come join us.

LinkedIn seeks a dynamic, seasoned engineer who is passionate about monitoring and measurement technologies and best practices. Experience in 24x7 site operations, strong relationship building with partners as well as within the company, a track record for automation, strong operational discipline and expertise in Monitoring tools and services at both the system and application layer is required. The ideal candidate will combine technical knowledge, project management strength, a metrics-driven analytical posture and a desire to expand scope and responsibility. Knowledge of CDN, DNS, and SSL are a plus.

Responsibilities:
• Drive continual improvement into monitoring/measurement/alerting practices and tools, with an emphasis on the acquisition, visualization and storage of site availability and performance metrics
• Evaluate new tools and make recommendations for implementation, implement as required
• Server as primary technical contact, owning the technical relationship with providers
• Leverage existing tools, both third party and in house developed, and API creation and management to maximize team’s ability to detect, troubleshoot, and resolve issues while managing cost
• Coordinate short- and long-term initiatives with LinkedIn DevOps teams, as well as partners – prioritizing and driving project closure on all sides
• Coordinate with service owners on best practices for monitoring (interval, comprehensive, effective, etc.) as well as other services supported by the team (DNS, CDN, SSL, Cloud, etc.)
• Develop and adhere to processes for internal support protocols
• Daily performance monitoring and actions to address open issues
• Participate in on-call rotation for CDN, DNS, and monitoring related escalations

Required Skills & Experience:
• 2 years’ experience with monitoring technology (system configuration/management)
• 2 years’ experience with various tools and providers (Zenoss, Nagios, Keynote, Gomez, etc.) and best practices
• 2 years REST and SOAP API development and shell programming experience, ability to provide and consume data
• Working Knowledge of Python to build tools and automation
• Working knowledge of HTML and JavaScript
• Knowledge of internet protocols (in practice and by RFC) – especially TCP/IP, HTTP and DNS
• Superb communication skills, both written and verbal
• Excellent planning, prioritization and project/time-management skills – especially in a cross-functional context
• Flexibility to work in a novel, dynamic and extremely fast-paced environment

Desired Skills and Experience

Preferred Skills & Experience:
• Knowledge of CDN and DNS technology and best practices
• Knowledge of SSL procurement, management and implementation processes
• Strong troubleshooting skills spanning code, network, HTTP, DNS, etc.
• Experience running a production consumer 24x7 website at scale
• Bachelor’s Degree in Computer Science


http://www.linkedin.com/jobs2/view/18039016?trk=jserp_job_details_text


No comments:

Post a Comment