Applied AI Data Engineer (Vector Databases, Data Management)
Gainsight
This job is no longer accepting applications
See open jobs at Gainsight.See open jobs similar to "Applied AI Data Engineer (Vector Databases, Data Management)" Insight Partners.Job Description:
Why Gainsight?
We are ranked #1 on Glassdoor’s 2023 Best Place to Work List. Here’s why.
At Gainsight, our mission is to be living proof you can win in business while being human-first.
Our industry-leading platform helps companies of all sizes and industries build durable businesses. Gainsight offers a powerful set of customer success, product, community and education solutions that enable businesses to scale efficiently, create alignment, and have a holistic view of their customers—all of which help increase product adoption, prevent churn, and grow renewals and expansions. Hundreds of companies use our software, including nearly 200 publicly traded organizations and industry leaders such as GE Digital, SAP Concur, and Zendesk. We have offices in the US, UK, Netherlands, Israel, Japan, and India.
Gainsight joined the Vista Equity Partners portfolio in 2020. In 2021, we won their Excellence in Engineering award for our product and engineering advancements.
Gainsight has also been named one of the top 100 private cloud companies by Forbes, one of the fastest-growing private companies in America by Inc. Magazine, and one of 20 Great Workplaces in Tech by Fortune Magazine.
With diversity and inclusion at the forefront of our values, we promote a culture that celebrates diversity and inclusiveness regardless of but not limited to, race, gender, sexual orientation, family status, religion, ethnicity, national origin, physical disability, veteran status, or age.
Job Summary:
Applied AI Data Engineer (Vector Databases, Data Management)
As an Applied AI Data Engineer, you will be responsible for building data pipelines, vector embeddings, and retrieval mechanisms that power AI reasoning systems. Your work ensures that LLMs remain grounded in fact, efficiently retrieving high-quality, contextually relevant data without noise or hallucinations.
You will design and implement features that harness vector search, retrieval-augmented generation (RAG), and domain-specific embeddings, directly influencing how AI models store, retrieve, and apply knowledge at scale.
What You’ll Do:
Build and optimize data pipelines that transform incoming documents into high-quality embeddings for AI retrieval.
Design and implement vector search strategies using Pinecone, Weaviate, FAISS, or Vespa to improve AI response relevance.
Develop retrieval-augmented generation (RAG) workflows, ensuring models access up-to-date and high-quality context.
Fine-tune chunking strategies and indexing frequencies to enhance information recall and factual accuracy.
Integrate hybrid search approaches (semantic + keyword) to improve precision and efficiency in knowledge retrieval.
Monitor retrieval logs and LLM interaction patterns, adjusting embedding configurations for maximum relevance.
Compare model performance (GPT-4, Claude, Llama 2) across different embedding structures and refine tuning strategies.
Experiment with metadata filtering techniques to dynamically surface the most relevant data for AI reasoning agents.
Collaborate with ML engineers and AI researchers to ensure data pipelines align with evolving AI capabilities.
Who You Are (Experience & Qualifications)
5–8+ years of experience in Data Engineering, AI Systems, or Machine Learning Infrastructure.
3+ years of hands-on experience working with vector databases, embeddings, and retrieval-augmented generation (RAG).
Strong understanding of vector search algorithms, indexing strategies, and hybrid search techniques.
Expertise in building and scaling data pipelines for AI-driven applications.
Proficiency in Python, along with experience using libraries such as Hugging Face, LangChain, and OpenAI SDKs.
Hands-on experience with vector database platforms (Pinecone, Weaviate, FAISS, ChromaDB, or Vespa).
Deep knowledge of LLM retrieval strategies, chunking methodologies, and context optimization.
Familiarity with semantic search, keyword search, and metadata filtering techniques.
Strong grasp of data governance, security, and optimization for AI-driven knowledge retrieval.
Experience integrating retrieval mechanisms with multi-agent AI systems.
Bonus Skills (Nice to Have):
Experience in fine-tuning transformer models for domain-specific retrieval tasks.
Familiarity with real-time indexing and adaptive embedding refresh strategies.
Understanding of LLM hallucination mitigation and factual consistency techniques.
Experience building scalable knowledge graphs and structured AI databases.
Background in AI-powered document processing and knowledge extraction.
Job Benefits
At Gainsight, our mission is to be living proof you can win in business while being human first.
Your job should never be a barrier to your happiness—it should be an avenue to achieve it. At Gainsight, we’re passionate about achieving our goals—at the office and everywhere—and we work every day to create an environment that nurtures our best selves.
Here are our 5 core values
● Golden Rule: We try to practice the Golden Rule by exercising reliability, trust and giving back to each other and our community.
● Success for All: We believe in success for our stakeholders—whether our teammates, clients or shareholders—comes with a sincere focus on continuous learning, selfless teaching and making a difference in each other’s lives.
● Child-like Joy: We aspire to experience child-like joy in our work and lives, injecting a spirit of passion, optimism and laughter into everything that we do.
● Shoshin: We believe in a beginner’s mind. Don’t surround yourself with people like you--diversity breeds creativity.
● Stay Thirsty, My Friends: We believe in a totally internally-driven strive for greatness. The solution is to think more, not do more.
Why You’ll Love It Here
● Our Attitude: We’ve created a new category from scratch and we continue to be the thought leader in Customer Success.
● Our Leadership: We offer the leading tech solution for driving Customer Success.
● Our ROI: Reduce customer churn, increase up-sell, and improve customer satisfaction.
● Our Technology: Our technology allows companies to drive retention and growth by delivering the value customers demand.
● Our Impact: In addition to helping companies grow, we’ve committed to $100 million in wage expansion for underrepresented groups over the next few years.
● Our Clients: Big companies like Box, Adobe, Marketo, and many others.
● Our Team: Our team is composed of innovative Customer Success thought leaders and experts in their field from various industries.
Benefits include medical, dental, vision, short and long-term disability, life insurance, 401k available on the first day of the month after start date, and flexible PTO.
Gainsight is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Pursuant to the San Francisco Fair Chance Ordinance, where applicable, we will consider for employment qualified applicants with arrest and conviction records.
Job Description Summary
This job is no longer accepting applications
See open jobs at Gainsight.See open jobs similar to "Applied AI Data Engineer (Vector Databases, Data Management)" Insight Partners.