Responsible AI Engineer (On-Prem LLMs)
Texas Integrated Services
Houston, TX
Full time
$110,000 - $160,000
Posted 4 days ago
About the Role
Texas Integrated Services builds on-prem AI servers for small Texas businesses so they can run LLaMA, Mistral, and similar models on hardware they own. You'll deploy, tune, and support those systems end-to-end.
You'll work across model quantization, vector databases (Qdrant), RAG pipelines, and hands-on server hardware setup. The mission is clear: keep customer data out of third-party APIs and in the customer's own infrastructure.
Ideal candidate: comfortable in both the GPU stack and in a client's server closet, and principled about what data should and should not leave a customer's building.
Requirements
- Strong Linux server + GPU deployment experience
- Experience running LLaMA or Mistral in production
- Familiarity with Qdrant or similar vector DBs
- RAG pipeline design and evaluation
- Understanding of HIPAA and attorney-client privilege and why on-prem matters for them
- Willingness to travel within Texas for install days
About Texas Integrated Services
Stop paying monthly fees to AI companies. Own your AI infrastructure.
Texas Integrated Services builds on-premise AI servers for small Texas businesses using open-source LLMs (LLaMA, Mistral), Qdrant vector databases, and RAG pipelines. The company eliminates recurring AI subscription costs by running everything on client-owned hardware.
Coalition Rank:
38.8
/ 100
Job ID: 43c9298f-953c-4018-b83f-b7b0a130ede1