Informatica
Principal Site Reliability Engineer (SRE) (Project Management)
Our Team
Informatica Cloud is a multi-tenant hosted service that provides both the industry leading Integration as a Service (IaaS) platform and integration applications that run on the platform. Since its inception, Informatica Cloud has won many industry awards and currently runs over 200,000 integration tasks that transfer over 1 Billion transactions per day, and you would be a key contributor to this high performing team.
Your Opportunity
You are a seasoned SRE Lead that is interested in joining the Informatica Cloud Business Unit to shape the direction of Informatica's Cloud products and play a key role in defining the bar for the functionality, performance, scalability and reliability for the multi-tenant Cloud based platform and service.
Informatica Cloud (www.informaticacloud.com) is the award-winning integration as a service platform and services that were created specifically to address the data integration needs of the new generation of cloud based software applications such as Salesforce, Facebook, Twitter etc. It is a hosted multi-tenant cloud-based service that delivers purpose-built data integration applications that allow business users to integrate data across cloud-based applications and on-premise systems and databases. Its unique and complete platform APIs allow users, ISVs and SIs to extend the existing services and create entirely new services that address specific integration needs of a vertical market or an ISV application. Informatica Cloud runs over 200,000 integration jobs daily and moves over 1 billion transactions daily (current statistics are accessible at http://trust.informaticacloud.com/status) and has a thriving community of users accessible at https://community.informatica.com/community/products/informatica_cloud). The service has won numerous industry awards including Best of Salesforce AppExchange which it has won for the last four years in a row.
Our Ideal Candidate
You are an expert in the area of large-scale distributed Cloud systems and have proven experience in building highly scalable, highly available web based enterprise class products and see cloud as the most effective way to deliver software applications and services.
You are excited by challenges surrounding the development of highly scalable, fault tolerant, distributed system for solving complex data integration problems. You have innovative ideas around maintaining coherency, low latency and manageability in a large-scale distributed system containing 1000's of nodes. You relish interacting with other architects, senior developers as well as executives from across Informatica to evangelize Informatica Cloud platform and services. You enjoy the prospect of having a significant hand in making Informatica Cloud the industry dominant integration platform as a service.
Your Responsibilities
- Define the best architecture and choose the best technologies, components and subsystems for a multi-tenant and ISV enabled platform capable of supporting and scaling with growing number of components and users
- Drive innovations that improve availability, resiliency and performance of the service
- Define and develop robust monitoring, automatic metrics collection and automatic repair of these systems to handle failures gracefully
- Evaluate related technologies built at Informatica and determine which of them can be leveraged to deliver innovative new integration services on Informatica Cloud
- Serve as a thought leader and mentor on technical, architectural, design and related issues
- Proactively identify architectural weaknesses and recommending appropriate solutions
- Work closely with the rest of the technology leadership team, including development, quality assurance, and technical operations to optimize the deployment and upgrades of the service
Your Qualifications:
Professional Experience:
- 12+ years of relevant professional experience, a portion of which was within a global enterprise software company
- Demonstrated success in building enterprise-class SaaS applications, hosting on public cloud infrastructure, deployment automation, continuous integration, and test-driven development
- Experienced architect with cross-domain, cross-functional and cross-industry expertise with demonstrated knowledge and skills that are both broad and deep
- History of leading multiple concurrent projects and performing in a variety of different roles in the software development life cycle
- Willingness and demonstrated ability to be hands on and close to the technology
- Demonstrated ability to share and communicate ideas to executive staff, business sponsors, technical resources and other key constituents in clear, concise language
Technical skills:
- BS in Computer Science or related fields; advanced degree a plus
- 10+ years of professional software development experience
- Expert in running Cloud @scale on cloud hosting provider experience (AWS, Azure, Google Cloud)
- Strong understanding across Cloud and infrastructure components (server, storage, network, data, and applications) to deliver end to end Cloud Infrastructure architectures and designs.
- Proven ability to architect and builds high volume, mission critical micro-service \ cloud native applications.
- Experience with Kubernetes, EKS, AKS
- Experience with monitoring tools SumoLogic, AppDynamics, Prometheus
- Experience leveraging and writing architectural patterns, best practices and guidelines for enterprise applications.
- 5+ years' experience as a Cloud SRE Lead, with focus on distributed systems architecture & design.
- Understands and can articulate the technical merits and value of Cloud computing
- Highly driven and results orientated: Drives results through people, communication, influence and interaction.
- Excellent communication and interpersonal skills; executive presence; well-honed influencing and negotiating skills
- Able to work independently with little direct supervision; take initiative; willing to mentor and develop others
- Strong analytical problem solving and decision-making skills
- Ability to react quickly to changing requirements due to product limitations or driven by enterprise needs
#LI-ET1