AI Engineer

Description

We are an NVIDIA partner, delivering professional services in advanced software development, artificial intelligence systems, and high-performance AI solutions. We build production-grade AI systems optimized for NVIDIA GPU platforms, working across multiple applied AI domains, including Generative AI / Agentic Systems and Computer Vision.

Role Overview

We are looking for an experienced Senior AI Engineer to take a leading role in designing, building, and delivering advanced AI systems for real-world applications. This role is suited for a hands-on engineer with strong software engineering skills and deep experience in applied AI, who enjoys solving complex problems, owning technical solutions end-to-end, and mentoring junior team members. Projects typically focus on one primary domain—either LLM-based agentic systems or computer vision systems—depending on business needs and expertise.

What We Work On

Our work spans two main AI domains:

1. Agentic AI & Large Language Models

Design and implementation of agentic systems powered by Large Language Models (LLMs) Deployment of open-source LLMs on NVIDIA GPUs High-performance inference using NVIDIA technologies such as NVIDIA NIM (Inference Microservices) System-level design for reliability, scalability, and performance

2. Computer Vision & Deep Learning

Design and development of advanced computer vision systems CNN- and transformer-based models for real-world vision tasks Detection, segmentation, tracking, and video analytics pipelines Performance optimization on GPU platforms

Key Responsibilities

Lead the design and development of AI systems in either LLM-based agentic systems or computer vision
Own end-to-end delivery of AI solutions, from architecture and prototyping to production deployment
Optimize model performance, latency, and throughput on NVIDIA GPU platforms
Design clean, maintainable, and scalable AI software architectures
Collaborate with customers, product teams, and engineers to translate requirements into technical solutions
Mentor junior engineers and contribute to technical best practices
Evaluate new tools, models, and frameworks and drive their adoption when appropriate

Requirements

BSc or MSc in Computer Science, Electrical Engineering, or a related field

2+ years of experience in software engineering and applied AI (or equivalent)

Strong proficiency in Python and modern AI frameworks Proven experience delivering production-grade AI systems

Solid understanding of deep learning architectures (CNNs, transformers)

Experience with system-level design, debugging, and performance optimization

Domain-Specific Experience (One or More)

LLM / Agentic Systems:

Experience working with Large Language Models (LLMs) Building agentic workflows, reasoning systems, or AI-driven applications Deploying and optimizing open-source LLMs for inference

Computer Vision:

Strong background in computer vision and deep learning Hands-on experience with detection, segmentation, and tracking models

Experience with video pipelines and real-time or near-real-time systems

Nice to Have

Experience with NVIDIA technologies (CUDA, TensorRT, Triton, NVIDIA NIM)

Experience with GPU performance profiling and optimization Background in high-performance or low-latency systems

Experience mentoring engineers or leading technical initiatives

What We Offer Ownership of complex, high-impact AI projects Work with cutting-edge NVIDIA GPU and AI technologies Influence over architecture, tooling, and technical direction A collaborative, engineering-driven culture Opportunities for technical leadership and professional growth Real-world, production-scale AI challenges Full time Job

Location: Haifa, Hybrid

We at Deloitte believe that diversity and inclusion among our people is a critical component of our success and that is why we cultivate an organizational culture that contains and embraces diversity in all its forms.

Apply

Description

Requirements

Share this job