Remote AI Engineer - Innovate with LLMs
About the Role
Join us as a Remote AI Engineer at Fastino, where we are building the next generation of large language models (LLMs). Our team, comprised of alumni from prestigious institutions like Google Research, Apple, Stanford, and Cambridge, is dedicated to developing specialized and efficient AI solutions. Fastino's GLiNER family of open-source models has achieved over 5 million downloads and is utilized by industry leaders such as NVIDIA, Meta, and Airbnb. With $25 million raised in our seed round and backing from prominent investors including Microsoft and Khosla Ventures, we are poised for significant growth.
What You’ll Work On
- Design and deploy high-performance agentic systems that leverage Fastino’s optimized model architectures to outperform traditional LLM benchmarks.
- Collaborate with engineering teams to bridge the gap between research and production, turning novel architectural breakthroughs into scalable, low-latency solutions for enterprise customers.
- Drive rapid, iterative prototyping of AI functionalities, refining model performance and task accuracy based on real-world telemetry to ensure specialized models meet rigorous developer standards.
- Own the stability and throughput of inference pipelines, proactively solving scalability bottlenecks to ensure models deliver consistent, reliable performance under massive operational loads.
- Architect large-scale data and fine-tuning strategies to continuously improve the precision and domain-specific reliability of Fastino models.
Requirements
- 2+ years of hands-on experience in AI/ML engineering roles.
- Demonstrated proficiency with LLMs and a proven track record of applying AI/ML techniques to solve complex, unstructured problems.
- Comfortable working across the stack from prompt engineering and vector DB tuning to Kubernetes deployment and API design.
Nice to Have
- Experience building microservices that handle high-concurrency agentic workloads.
- Familiarity with GLiNER or other information extraction architectures.
What We Offer
- Competitive salary ranging from $120,000 to $150,000 annually.
- Fully remote work environment, allowing you to work from anywhere in the world.
- Opportunity to work with a talented team of professionals from top tech companies.
- Access to cutting-edge technology and resources to enhance your skills.
- Flexible working hours to promote work-life balance.
This Remote AI Engineer position at Fastino offers a unique opportunity to work on cutting-edge LLMs with a talented team. Enjoy a competitive salary and the flexibility of remote work.
Who Will Succeed Here
Proficiency in implementing and fine-tuning large language models (LLMs) using frameworks such as TensorFlow or PyTorch, with a strong understanding of transformer architectures.
Experience in deploying machine learning models in a Kubernetes environment, ensuring scalability and reliability of AI solutions in a microservices architecture.
A problem-solving mindset with a focus on innovative AI solutions, demonstrated through previous projects involving API design and integration with vector databases for optimized data retrieval.
Learning Resources
Career Path
Market Overview
Skills & Requirements
Domain Trends
Industry News
Loading latest industry news...
Finding relevant articles from the last 6 months