View All Jobs 133765

Data Scientist (vector DB Engineer – Data Scientist

Develop and optimize vector database solutions supporting large-scale data retrieval
Bangalore
Mid-Level
16 hours agoBe an early applicant
Caterpillar

Caterpillar

A leading manufacturer of construction and mining equipment, diesel and natural gas engines, industrial turbines, and diesel-electric locomotives.

38 Similar Jobs at Caterpillar

Data Scientist (Vector DB Engineer – Data Scientist)

Career Area: Technology, Digital and Data

Job Description: Your work shapes the world at Caterpillar Inc.

When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.

Job Summary: We are seeking a skilled Data Scientist (Vector DB Engineer – Data Scientist) for Applications Development & Intelligence Automation - CAT IT Division.

The preference for this role is to be based out of Whitefield PSN Office, Bangalore

Job Roles and Responsibilities: As a Vector DB Engineer, you will be responsible for designing, implementing, and optimizing vector databases that enable high-performance, large-scale data processing and retrieval. You will work closely with our data science, machine learning, and software engineering teams to build robust solutions that support our clients' data-intensive applications.

Roles & Responsibilities:

  • Design, implement, and manage vector databases to support large-scale data storage and retrieval, ensuring low latency and high availability.
  • Develop efficient data models that facilitate fast vector operations such as similarity search, nearest neighbor search, and other vector-based queries.
  • Optimize database performance through indexing, partitioning, sharding, and other techniques to handle large-scale datasets.
  • Integrate vector databases with existing systems and applications, ensuring seamless data flow and accessibility.
  • Design and implement solutions that scale with growing data volumes, ensuring the database infrastructure can handle increased load and complexity.
  • Implement security best practices to protect data at rest and in transit, including encryption, access controls, and audit logging.
  • Monitor database performance and troubleshoot issues as they arise, ensuring system reliability and availability.
  • Work closely with data scientists, machine learning engineers, and software developers to understand their needs and provide database solutions that meet their requirements.
  • Maintain comprehensive documentation for database schemas, configurations, and procedures to support operational excellence and knowledge sharing.

What you will have:

  • Deep understanding and hands-on experience with vector databases, including their architecture, query languages, and optimization techniques.
  • Strong programming skills in languages such as Python, C++, or Java, with experience in developing and optimizing database operations.
  • Solid understanding of data structures, algorithms, and computational geometry, particularly related to vector search and similarity measures.
  • Experience with cloud platforms (e.g., AWS, GCP, Azure) and managed database services.
  • Understanding of machine learning concepts, particularly those related to embedding vectors and similarity searches.
  • Strong problem-solving skills with a focus on performance optimization and scalability.
  • Excellent communication skills, with the ability to articulate complex technical concepts to non-technical stakeholders.
  • A 5-year full-time education is required.
  • This position requires candidate to work a 5-day a week schedule in the office
  • Shift Timing- 01:00PM -10:00PM IST

Other Preferred Skills:

  • Knowledge of general accounting practices, passion for financial reporting, ability to learn and adapt quickly, and a strong positive attitude.
  • Maintaining stable performance under demanding business needs and support to the business to the urgency.

Skills Desired:

Business Statistics: Knowledge of the statistical tools, processes, and practices to describe business results in measurable scales; ability to use statistical tools and processes to assist in making business decisions.

Accuracy and Attention to Detail: Understanding the necessity and value of accuracy; ability to complete tasks with high levels of precision.

Analytical Thinking: Knowledge of techniques and tools that promote effective analysis; ability to determine the root cause of organizational problems and create alternative solutions that resolve these problems.

Machine Learning: Knowledge of principles, technologies and algorithms of machine learning; ability to develop, implement and deliver related systems, products and services.

Programming Languages: Knowledge of basic concepts and capabilities of programming; ability to use tools, techniques and platforms in order to write and modify programming languages.

Query and Database Access Tools: Knowledge of data management systems; ability to use, support and access facilities for searching, extracting and formatting data for further use.

Requirements Analysis: Knowledge of tools, methods, and techniques of requirement analysis; ability to elicit, analyze and record required business functionality and non-functionality requirements to ensure the success of a system or software development project.

What you will get:

  • Work Life Harmony
  • Earned and medical leave.
  • Relocation assistance

Holistic Development:

  • Personal and professional development through Caterpillar's employee resource groups across the globe
  • Career development opportunities with global prospects

Health and Wellness:

  • Medical coverage -Medical, life and personal accident coverage
  • Employee mental wellness assistance program

Financial Wellness:

  • Employee investment plan
  • Pay for performance -Annual incentive Bonus plan.

Additional Information: Caterpillar is not currently hiring individuals for this position who now or in the future require sponsorship for employment visa status; however, as a global company, Caterpillar offers many job opportunities outside of the U.S. which can be found through our employment website.

Caterpillar is an Equal Opportunity Employer (EEO) EEO/AA Employer. All qualified individuals, including minorities, females, veterans and individuals with disabilities - are encouraged to apply.

Not ready to apply? Join our Talent Community.

+ Show Original Job Post
























Data Scientist (vector DB Engineer – Data Scientist
Bangalore
Engineering
About Caterpillar
A leading manufacturer of construction and mining equipment, diesel and natural gas engines, industrial turbines, and diesel-electric locomotives.