Job Description

Overview 

 

We are seeking a skilled and experienced Site Reliability Engineer to join our team. In this role, you will be part of the AI & Cloud Engineering (ACE) Division and AI Workbench team. Our AI Workbench is a cloud-based environment to accelerate Automotive AI Software Development and Evaluation. The AI Workbench has 4 main functional blocks today, with one of those blocks providing access to both SILS (Software in the Loop Simulator) and HILS (Hardware in the Loop Simulator). 

   

As a Site Reliability Engineer - ACE, you will be responsible for designing, building, and  maintaining our infrastructure. You should have a strong background in cloud technologies and excellent problem-solving skills. You will work closely with multiple engineering teams (and cross-function teams) to support their infrastructure requirements.  

 

Our division’s mission is to use the latest AI and cloud technologies to develop the best AI inference for advanced driver safety engineers building self-driving vehicles and other high performance compute products. Renesas is the leading automotive electronics supplier globally, and this is a rare opportunity to develop the infrastructure required to deploy our AI software to the billions of devices we ship to customers every year. You will join our newly formed AI & Cloud Engineering organization of around 100 software engineers. Due to strong demand for our AI-related products we are planning to triple in size in the next three years, so there is lots of room for you to help us grow the team together while remaining small. Our team’s key locations are Tokyo, London, Paris, Dusseldorf, Beijing, Singapore, Ho Chi Minh City and other metropolitan areas, but you can also join fully remotely from other locations globally or get our support to relocate to our key hubs such as Tokyo. 

Responsibilities   

  • Design, build, and maintain our division’s infrastructure that supports our application development, with a focus on reliability, scalability, and performance.  

  •  Implement and automate deployment, monitoring, and scaling processes to ensure the smooth operation of our systems and services.  

  • Monitor system performance and reliability metrics, troubleshoot issues, and implement solutions to prevent downtime and improve efficiency.  

  • Collaborate with our teams Engineers to design, develop, and deploy reliable and scalable applications. 

  • Develop and maintain tools and scripts for automation, configuration management, and monitoring of our infrastructure and applications. 

  • Respond to incidents and emergencies to minimize downtime and ensure reliability of or systems.  

  • Continuously evaluate and improve our infrastructure, processes, and practises to enhance reliability, scalability, and efficiency.  

  • Stay up-to-date with industry trends, best practises, and emerging technologies in site reliability engineering and cloud computing. 

 

Type:
Permanent
Contract Length:
N/A
Job Reference:
406000228478638
Job ID:
1258000000000270229

Remember: You should never send cash or cheques to a prospective employer, or provide any financial information. Please get in touch if you see any roles asking for payments or financial details from you. For more information, visit jobsaware.co.uk.

Create new Job Alert

Create a new Job Alert to make sure you see the best new jobs first!

Your search has been saved and has been added to your Job Alerts