Ai Inference Engineer Qvac (México)

Ai Inference Engineer Qvac (México)

06 may
|
Jobgether
|
México

06 may

Jobgether

México

This position is posted by Jobgether on behalf of a partner company.We are currently looking for an AI Inference Engineer QVAC in Mexico.This role offers a unique opportunity to work at the cutting edge of on-device AI, building the core systems that power fast, private, and reliable inference on real-world hardware.You will operate close to the metal, designing and optimizing the runtime layer that enables machine learning models to perform efficiently without relying on cloud infrastructure.The position sits at the intersection of systems engineering and AI, where performance, stability, and scalability are critical.You will collaborate with researchers and product teams to bring advanced models into production environments.With a strong focus on low-level optimization and architecture, your work will directly shape the future of decentralized, peer-to-peer AI experiences.This is an idóneo role for engineers who enjoy deep technical challenges and ownership of core infrastructure.AccountabilitiesDevelop and optimize C++-based inference systems for deploying AI models on edge devices.Enhance and adapt inference engines such as llama.cpp, ggml, and ONNX for improved performance and compatibility.Improve runtime efficiency, focusing on memory usage, latency, throughput, and long-session stability.Collaborate with research teams to transition models from experimentation to production-ready deployments.Define and maintain core abstractions that support scalable and maintainable inference capabilities.Integrate AI-driven features into existing products,



ensuring seamless performance and reliability.Continuously evaluate and implement new technologies to improve system capabilities and efficiency.RequirementsYou are a highly skilled engineer with a strong foundation in systems programming and machine learning, capable of working on complex, performance-critical AI infrastructure.Strong programming expertise in C++, with additional experience in JavaScript considered a plus.Proven experience with inference frameworks such as llama.cpp, ggml, ONNX, or similar technologies.Solid understanding of deep learning concepts, including transformers, LLMs, and diffusion models.Experience deploying and optimizing machine learning models on edge devices or constrained environments.Ability to quickly learn and apply new technologies in a fast-evolving AI landscape.Strong problem-solving skills with attention to performance, scalability, and reliability.Degree in Computer Science, AI, Machine Learning, or a related field, or equivalent practical experience.BenefitsFully remote, globally distributed work environmentOpportunity to work on cutting-edge AI and decentralized technologiesHigh ownership and impact on core product infrastructureCollaboration with top talent in AI, systems engineering, and fintechDynamic, fast-paced environment focused on innovation and experimentationExposure to advanced AI frameworks and next-generation product developmentCompetitive compensation aligned with experience and expertise#J-*****-Ljbffr

📌 Ai Inference Engineer Qvac (México)
🏢 Jobgether
📍 México

Postulate a este anuncio

Muestra tus habilidades a la empresa, rellenar el formulario y deja un toque personal en la carta, ayudará el reclutador en la elección del candidato.

Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai inference engineer qvac (méxico) / méxico
Suscribete a esta alerta:
Escribe tu dirección de correo electrónico, te permitirá de estar al tanto de los últimos empleos por: ai inference engineer qvac (méxico) / méxico