We are looking for highly motivated Data Scientist Interns to join our team and work on an exciting Retrieval-Augmented Generation (RAG) project. This is a great opportunity to gain hands-on experience with cutting-edge AI and NLP technologies while contributing to real-world applications.
Responsibilities: * Assist in the design, development, and evaluation of RAG-based models. * Preprocess, clean, and analyze large datasets for model training and evaluation. * Implement and optimize retrieval and generation components using state-of-the-art techniques. * Collaborate with the engineering and research teams to improve system performance. * Document experiments and present findings.
Requirements: * Strong foundation in Python and libraries such as NumPy, Pandas, and Scikit-learn. * Basic understanding of Machine Learning and Natural Language Processing concepts. * Familiarity with transformer-based models (e.g., BERT, GPT) is a plus. * Knowledge of vector databases (e.g., FAISS, Milvus) and LLMs is desirable. * Ability to learn quickly and work in a collaborative environment.
What We Offer: * Hands-on experience with real-world RAG systems and LLMs. * Mentorship from experienced data scientists and engineers. * Flexible working hours and remote-friendly environment. * Opportunity to transition into a full-time role based on performance.