Responsible AI Engineer (On-Prem LLMs)

Texas Integrated Services
Houston, TX
Full time
$110,000 - $160,000
Posted 4 days ago

About the Role

Texas Integrated Services builds on-prem AI servers for small Texas businesses so they can run LLaMA, Mistral, and similar models on hardware they own. You'll deploy, tune, and support those systems end-to-end. You'll work across model quantization, vector databases (Qdrant), RAG pipelines, and hands-on server hardware setup. The mission is clear: keep customer data out of third-party APIs and in the customer's own infrastructure. Ideal candidate: comfortable in both the GPU stack and in a client's server closet, and principled about what data should and should not leave a customer's building.

Requirements

- Strong Linux server + GPU deployment experience - Experience running LLaMA or Mistral in production - Familiarity with Qdrant or similar vector DBs - RAG pipeline design and evaluation - Understanding of HIPAA and attorney-client privilege and why on-prem matters for them - Willingness to travel within Texas for install days

About Texas Integrated Services

Stop paying monthly fees to AI companies. Own your AI infrastructure.

Texas Integrated Services builds on-premise AI servers for small Texas businesses using open-source LLMs (LLaMA, Mistral), Qdrant vector databases, and RAG pipelines. The company eliminates recurring AI subscription costs by running everything on client-owned hardware.

Coalition Rank:
38.8
/ 100
Job ID: 43c9298f-953c-4018-b83f-b7b0a130ede1