GenAI Vector DB Engineer
Futran Tech Solutions Pvt. Ltd.·US
Posted 2928w ago
Full-Time
Apply Now About the Role
GenAI VectorDB Engineer
Remote
Fulltime
Candidates may need to travel occasionally at client site.
Responsibilities:
The ideal candidate will have expertise in designing, implementing, and optimizing vector databases, with a strong focus on utilizing ChoromDB/Pinecone for vector applications. The role involves contributing to the development and maintenance of our data infrastructure, ensuring efficient handling of complex relationships and vectors.
• Design and implement vector databases to efficiently store and retrieve high-dimensional vectors.
• Optimize database queries, indexing strategies for vector operations.
• Architect and performance tune vector pipeline for embedding and text similarity search
• Identify and resolve performance bottlenecks to ensure efficient data retrieval.
• Collaborate with application developers to integrate vector databases and knowledge graphs into various software solutions.
• Provide support for query optimization and data modeling for application-specific requirements.
• Implement and maintain data security measures for vector databases.
• Ensure compliance with relevant data protection regulations and industry standards.
• Work closely with cross-functional teams, including data scientists, software engineers, and product managers.
• Communicate technical concepts and solutions effectively to both technical and non-technical stakeholders.
Technical skills:
• Knowledge of distributed database systems.
• Familiarity with machine learning and AI concepts related to vector data.
• Experience with cloud-based database solutions.
• Proven experience in designing and implementing vector databases, with a focus on ChromaDB/Pinecone etc for vector applications.
• Strong proficiency in embeddings, vectorization, vector stores, database optimization, performance tuning, and relevant query languages.
• Familiarity with embedding, retrieval algorithms, agents, data modeling for vector development graphs.
• Experience with LLM and other related frameworks like Langchain, LLama
• Experience with relevant programming languages, such as Python, Java, or Scala.
• Excellent problem-solving skills and the ability to work in a collaborative team environment.
Soft skills:
• Strong work ethic and desire to produce quality results
• Consistently and proactively communicates (verbally/written) to stakeholders (progress/roadblocks/etc.)
• Continuous Improvement mindset and approach to work product
• Ability to take complex subjects and simplify it to less technical individuals
• Provides clear documentation of processes, workflows, recommendations, etc.
• High level of critical thinking capabilities
• Organized and has the ability to manage work effectively, escalating issues as appropriate
• Takes initiative & is a self-starter
• Displays ownership of their work (quality, timeliness)
• Seeks to become an expert in their field and shares their expertise through recommendations, proactive communications/actions and peer sharing/coaching where relevant
• Should be able to communicate with stakeholders directly and independently
• Should have good problem solving skills
Candidate Profile:
• Bachelor's/Master's degree in economics, mathematics, computer science/engineering, operations research or related analytics areas; candidates with BA/BS degrees in the same fields from the top tier academic institutions are also welcome to apply
• 8+ years of experience working with data engineering with atleast 2-3 years of experience working on vector databases.
• Strong experience with vector databases including ChromaDB, Pinecone and corresponding implementations over cloud platforms (AWS, Azure etc.)
• Outstanding written and verbal communication skills
• Superior analytical and problem solving skills
• Experience in working in dual shore engagement is preferred
• Must have experience in managing clients directly
• Strong record of achievement, solid analytical ability, and an entrepreneurial hands-on approach to work
• Able to work in fast pace continuously evolving environment and ready to take up uphill challenges
• Is able to understand cross cultural differences and can work with clients across the globe
What you'll do
- Candidates may need to travel occasionally at client site
- The ideal candidate will have expertise in designing, implementing, and optimizing vector databases, with a strong focus on utilizing ChoromDB/Pinecone for vector applications
- The role involves contributing to the development and maintenance of our data infrastructure, ensuring efficient handling of complex relationships and vectors
- Design and implement vector databases to efficiently store and retrieve high-dimensional vectors
- Optimize database queries, indexing strategies for vector operations
- Architect and performance tune vector pipeline for embedding and text similarity search
- Identify and resolve performance bottlenecks to ensure efficient data retrieval
- Collaborate with application developers to integrate vector databases and knowledge graphs into various software solutions
- Provide support for query optimization and data modeling for application-specific requirements
- Implement and maintain data security measures for vector databases
- Ensure compliance with relevant data protection regulations and industry standards
- Work closely with cross-functional teams, including data scientists, software engineers, and product managers
- Communicate technical concepts and solutions effectively to both technical and non-technical stakeholders
- Displays ownership of their work (quality, timeliness)
Requirements
- Knowledge of distributed database systems
- Familiarity with machine learning and AI concepts related to vector data
- Experience with cloud-based database solutions
- Proven experience in designing and implementing vector databases, with a focus on ChromaDB/Pinecone etc for vector applications
- Strong proficiency in embeddings, vectorization, vector stores, database optimization, performance tuning, and relevant query languages
- Familiarity with embedding, retrieval algorithms, agents, data modeling for vector development graphs
- Experience with LLM and other related frameworks like Langchain, LLama
- Experience with relevant programming languages, such as Python, Java, or Scala
- Excellent problem-solving skills and the ability to work in a collaborative team environment
- Strong work ethic and desire to produce quality results
- Consistently and proactively communicates (verbally/written) to stakeholders (progress/roadblocks/etc.)
- Continuous Improvement mindset and approach to work product
- Ability to take complex subjects and simplify it to less technical individuals
- Provides clear documentation of processes, workflows, recommendations, etc
- High level of critical thinking capabilities
- Organized and has the ability to manage work effectively, escalating issues as appropriate
- Takes initiative & is a self-starter
- Seeks to become an expert in their field and shares their expertise through recommendations, proactive communications/actions and peer sharing/coaching where relevant
- Should be able to communicate with stakeholders directly and independently
- Should have good problem solving skills
- Bachelor's/Master's degree in economics, mathematics, computer science/engineering, operations research or related analytics areas; candidates with BA/BS degrees in the same fields from the top tier academic institutions are also welcome to apply
- 8+ years of experience working with data engineering with atleast 2-3 years of experience working on vector databases
- Strong experience with vector databases including ChromaDB, Pinecone and corresponding implementations over cloud platforms (AWS, Azure etc.)
- Outstanding written and verbal communication skills
- Superior analytical and problem solving skills
- Must have experience in managing clients directly
- Strong record of achievement, solid analytical ability, and an entrepreneurial hands-on approach to work
- Able to work in fast pace continuously evolving environment and ready to take up uphill challenges
- Is able to understand cross cultural differences and can work with clients across the globe