Software Engineer, AI/ML GenAI job opportunity at Instrumentl.



Date2026-01-09 bot
Instrumentl Software Engineer, AI/ML GenAI
Experience: 5-years
Pattern: Remote
apply Apply Now
Salary:
Status:

AI/ML GenAI

Copy Link Report
degreeGeneral
loacation Remote, Remote
loacation Remote....Remote
Auto GPT Summarize Enabled

<p><span style="font-size: 12pt;">👋</span><strong style="font-size: 12pt;">Hello, we’re Instrumentl.&nbsp;</strong><span style="font-size: 12pt;">We’re a mission-driven startup helping the nonprofit sector to drive impact, and we’re well on our way to becoming the #1 most-loved grant discovery and management tool.&nbsp;</span></p> <p>&nbsp;</p> <p><strong style="font-size: 12pt;">About us: </strong><span style="font-size: 12pt;">Instrumentl is a hyper growth YC-backed startup with over 4,000 nonprofit clients, from local homeless shelters to larger organizations like the San Diego Zoo and the University of Alaska. We are building the future of fundraising automation, helping nonprofits to discover, track, and manage grants efficiently through our SaaS platform. Our charts are dramatically up-and-to-the-right 📈 — we’re cash flow positive and doubling year-over-year, with customers who love us (NPS is 65+ and Ellis PMF survey is 60+). Join us on this rocket ship to Mars!</span></p> <p>&nbsp;</p> <p><strong style="font-size: 12pt;">About the Role : </strong><span style="font-size: 12pt;">As a </span><strong style="font-size: 12pt;">Software Engineer, AI/ML GenAI</strong><span style="font-size: 12pt;"> at Instrumentl, you’ll own the full lifecycle of AI features—from </span><strong style="font-size: 12pt;">rapid prototyping</strong><span style="font-size: 12pt;"> to </span><strong style="font-size: 12pt;">production deployment and ongoing evaluation</strong><span style="font-size: 12pt;">. You will build </span><strong style="font-size: 12pt;">agentic LLM systems</strong><span style="font-size: 12pt;"> that can plan and use tools, implement </span><strong style="font-size: 12pt;">RAG pipelines</strong><span style="font-size: 12pt;"> over our domain data, manage and evolve </span><strong style="font-size: 12pt;">embeddings</strong><span style="font-size: 12pt;">, and stand up&nbsp;</span><strong style="font-size: 12pt;">evaluation/observability</strong><span style="font-size: 12pt;"> so our AI is grounded, safe, and cost‑effective. You’ll embed with one of the product&nbsp;pods in a hands-on role, collaborating closely with Product and Design, while partnering with DTI on platform‑level AI capabilities. </span></p> <p>&nbsp;</p> <p><span style="font-size: 12pt;">The Instrumentl team is fully distributed (though if you’d like to work from our Oakland office, we would love to see you there). For this position, we are looking for someone who has overlap with Pacific Time Zone working hours.</span></p>\n<p></p><p><br></p><b>What you will do</b><ul> <li><strong>Design agentic systems &amp; ship AI to production: </strong>Build resilient, observable services, while optimizing cost and latency budgets. Build tool‑using LLM “agents” (task planning, function/tool calling, multi‑step workflows, guardrails) for tasks like grant discovery, application drafting, document parsing and many more.</li> <li><strong>Own RAG end‑to‑end: </strong>Ingest and normalize content, choose chunking/embedding strategies, implement hybrid retrieval, re‑ranking, citations, and grounding. Continuously improve recall/precision.</li> <li><strong>Manage embeddings at scale: </strong>Select, evaluate, and migrate embedding models; maintain vector stores (e.g., pgvector/Qdrant/Pinecone etc.); monitor drift and rebuild strategies.</li> <li><strong>Collaborate cross‑functionally while raising engineering standards:&nbsp;</strong>Work side by side with Product, Design on scoping, UX, and measurement; run experiments (A/B, canaries), interpret results, and iterate. Write clear, maintainable code, add tests and docs, and contribute to reliability practices (alerts, dashboards, incident response).</li> </ul><p><br></p><b>What we're looking for </b><ul> <li><strong>Software engineering background: </strong>5+ years of professional software engineering experience (as an IC), including 2+ years working with modern LLMs.</li> <li><strong>Proven production impact: </strong>You’ve taken LLM/RAG systems from prototype to production, owned reliability/observability, and iterated post‑launch based on evals and user feedback.</li> <li><strong>LLM agentic systems: </strong>Experience building tool/function‑calling workflows, planning/execution loops, and safe tool integrations (e.g., with LangChain/LangGraph, LlamaIndex, Semantic Kernel, or custom orchestration).</li> <li><strong>RAG expertise: </strong>Strong grasp of document ingestion, chunking/windowing, embeddings, hybrid search (keyword + vector), re‑ranking, and grounded citations. Experience with re‑rankers/cross‑encoders, hybrid retrieval tuning, or search/recommendation systems.</li> <li><strong>Embeddings &amp; vector stores:</strong> Hands‑on with embedding model selection/versioning and vector DBs (e.g., pgvector, Qdrant, Pinecone, Weaviate, Milvus etc.).</li> <li><strong>Evaluation mindset: </strong>Comfort designing eval suites (RAG/QA, extraction, summarization), using automated and human‑in‑the‑loop methods; familiarity with frameworks like Ragas/DeepEval/OpenAI Evals or equivalent.</li> <li><strong>Infrastructure &amp; languages: </strong>Proficiency in Python (FastAPI, Celery); Experience with GCP/AWS, Docker, CI/CD, and observability (logs/metrics/traces).</li> <li><strong>Data chops: </strong>Comfortable with SQL, schema design, and building/maintaining data pipelines that power retrieval and evaluation.</li> <li><strong>Collaborative approach: </strong>You thrive in a cross‑functional environment and can translate research ideas into shippable, user‑friendly features.</li> <li><strong>Results‑driven: </strong>Bias for action and ownership with an eye for speed, quality, and simplicity.</li> </ul><p><br></p><b>Nice to have </b><ul> <li><strong>Startup Experience </strong>and&nbsp;comfort operating in fast, scrappy environments is a plus.</li> <li><strong>Familiarity with responsible AI,</strong> red‑teaming, and domain‑specific safety policies.</li> <li><strong>Fine‑tuning:&nbsp;</strong>Practical experience with SFT/LoRA or instruction‑tuning (and good intuition for when fine‑tuning vs. prompting vs. model choice is the right lever).</li> </ul> <div>&nbsp;</div><p><br></p><b>Compensation &amp; Benefits </b><ul> <li>Salary ranges are based on market data, relative to our size, industry, and stage of growth. Salary is one part of total compensation, which also includes equity, perks, and competitive benefits.&nbsp;</li> <li>For US-based candidates, our target salary band is&nbsp;<strong>$175,000 - $220,000/year&nbsp;</strong>+ equity. Salary decisions will be based on multiple factors including geographic location, qualifications for the role, skillset, proficiency, and experience level.&nbsp;</li> <li>100% covered health, dental, and vision insurance for employees, 50% for dependents.</li> <li>Generous PTO policy, including parental leave.</li> <li>401(k).</li> <li>Company laptop + stipend to set up your home workstation.</li> <li>Company retreats for in-person time with your colleagues.</li> <li>Work with awesome nonprofits around the US. We partner with incredible organizations doing meaningful work, and you get to help power their success.</li> </ul><p><br></p><p></p>\n

Other Ai Matches

remote-jobserver Remote
Inbound Sales Development Representative - Entry Level Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Sales Manager - Mid-Market and Enterprise Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Senior Data Engineer Applicants are expected to have a solid experience in handling Job related tasks
remote-jobserver Remote
Account Manager Applicants are expected to have a solid experience in handling Job related tasks