Edge AI Machine Learning Engineer

Computaris
Oraș: Chișinău
Studii: Universitate
Experiența de munca: Peste 5 ani
Salariu: Nespecificat
Program de muncă: Full-time
R Systems is seeking a talented Edge AI Machine Learning Engineer with specialized expertise in embedded GPU/NPU acceleration to join our team.

The ideal candidate will have extensive hands-on experience in developing and optimizing AI inference models for embedded GPU/NPU architectures. As a Principal Machine Learning Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/NPU acceleration and enterprise-grade, large-scale edge computing. You will combine technical excellence with effective leadership, creating a positive impact on both projects and team dynamics.
If you are a skilled Edge AI Machine Learning Engineer with a passion for pushing the boundaries of edge computing and GPU/NPU acceleration, we want to hear from you!
Apply now to be part of our dynamic and collaborative team and join us in shaping the future of AI at the edge and revolutionizing industries with innovative Edge AI solutions!
Your contribution:

Influence the Edge AI strategy by providing expert advice on design and architecture.
Make critical decisions regarding technical directions, scalability, and system performance.
Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on low-latency inference runtimes.
Implement and fine-tune low-latency inference pipelines to meet real-time performance requirements.
Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
Collaborate with the GPU Hardware Design Team to design and optimize GPUs/NPUs that power next-generation devices.
Conduct performance profiling and optimization to maximize the efficiency of GPU/NPU acceleration for Edge AI inference.
Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
Stay current with advancements in GPU, NPU, and Edge AI frameworks, incorporating them into solution designs as appropriate.
Provide technical expertise and support to project teams, ensuring successful implementation and deployment of Edge AI solutions.

Bring along:

Bachelor’s degree in computer science, Engineering, or a related field; Master’s degree preferred.
5+ years of hands-on experience in AI model development and deployment, with a focus on edge computing and inference runtime optimization.
Strong programming skills in languages such as Python and C/C++
Proficiency in ML frameworks (e.g., Scikit-learn, TensorFlow, PyTorch, XGBoost) with focus on edge AI applications deployment (e.g. Glow, TFLite, TensorRT)
Experience with MLOps frameworks (e.g., Kubeflow, MLflow, TFX, Airflow, H2O)
Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning
Hands on experience with one or more GPU/NPU frameworks: CUDA, Vulkan, OpenCL, familiarity with NVIDIA Jetson, ARM Mali, or relevant SoC configurations
Knowledge of parallel computation, memory scheduling, and structural optimization
Excellent problem-solving and analytical skills, with a passion for innovation and continuous learning

Additional Skills (Preferred):

Experience with edge device hardware and software integration.
Familiarity with edge computing architectures and IoT platforms.
Experience with edge AI applications in domains such as robotics, autonomous vehicles, or industrial automation
Email: recruitmentmd@rsystems.com
Atașează CV-ul CV de pe site
Atașează CV-ul

Indicați Nume și Prenume

Ai un CV pe site? Intră
Trimite CV-ul

Alegeți jobul

Adaugă o scrisoare de intenție
Închide
Felicitări,
CV-ul tău a fost trimis cu succes! 🥳
Închide