All jobs
Trase SystemsData
Principal AI Researcher (Agentic Systems & AI Infrastructure)
United States$250,000-300,000Posted yesterday
Trase Systems is an AI company specializing in deploying, managing, and optimizing AI solutions for enterprise environments, with a focus on AI Agent innovation and autonomous systems in regulated industries.
Location: United States
Salary: $250,000-300,000
Responsibilities
- Define and evolve the long-term AI/ML research strategy and technical roadmap for Trase OS.
- Lead large-scale experimentation and prototyping efforts requiring significant compute infrastructure.
- Drive original research and technical breakthroughs in agentic systems, autonomous execution, multi-agent orchestration, post-training and fine-tuning systems, SLM/LLM-based architectures, and applied AI infrastructure.
- Design how models operate within long-lived execution environments, including agent workflows, tool use, planning, memory systems, reasoning, and human-in-the-loop controls.
- Establish evaluation methodologies and reliability frameworks for autonomous systems, including benchmarking, regression testing, safety, controllability, and production behavior analysis.
- Drive architecture decisions across orchestration, model serving, routing, inference, and infrastructure governance.
- Partner closely with engineering and product teams to operationalize research outcomes into deployable systems and enterprise workflows.
- Build AI systems that operate reliably in regulated and constrained environments, including secure cloud, on-premise, and air-gapped deployments.
- Contribute to the broader AI research community through technical papers, publications, conference participation, architecture proposals, and thought leadership.
- Serve as a senior technical authority and mentor across the organization.
Requirements
- 12–15+ years of experience in machine learning, AI systems, or applied AI research, including experience operating at a Principal, Distinguished, or equivalent technical level.
- Strong research and publication track record, including authored papers, major technical contributions, or active participation in frontier AI research.
- Experience publishing at top-tier conferences or contributing influential open-source, research, or AI infrastructure systems.
- Experience conducting large-scale experimentation requiring significant compute infrastructure, evaluation workflows, and iterative model/system analysis.
- Deep expertise in one or more areas including agentic systems, LLMs and generative AI, multi-agent systems, reasoning systems, reinforcement learning, orchestration infrastructure, AI systems reliability, NLP, multimodal systems, or deep learning.
- Hands-on experience with agent-based systems, prompt engineering, RAG, RLHF, SLMs, fine-tuning/post-training techniques, tool integration, memory systems, and human-in-the-loop orchestration.
- Proven experience building, deploying, and operating enterprise-grade AI systems, including GenAI, LLM, or agent-based applications at scale.
- Strong understanding of ML system behavior in production, including reliability, latency, cost tradeoffs, observability, evaluation frameworks, regression testing, and failure modes.
- Strong systems thinking and demonstrated ability to partner cross-functionally with engineering and product organizations to move research into production systems.
- Strong programming and prototyping skills in Python and modern ML infrastructure stacks, with experience in Java or related systems languages preferred.
- Experience deploying AI/ML systems in regulated, constrained, or enterprise environments, and demonstrated ability to lead technical direction from research through production impact.
Location
United States
Salary
$250,000-300,000
Category
DataCompany
Trase SystemsSource
himalayas
Posted
yesterday
Skills & Tags