Highlights
Hands-on experience, LLMs & RAG pipelines, Python programming, Exploratory Data Analysis (EDA), Collaborate with cross-functional teams
Description
Job Summary
pWe are seeking a passionate intern to join our team and contribute to the development of cutting-edge AI systems. This role offers hands-on experience with data science, large language models, and real-world applications. Ideal for those starting their career in data science or technology.
Responsibilities
- Assist in data collection, cleaning, and preprocessing
- Perform exploratory data analysis (EDA) and basic statistical analysis
- Build and evaluate baseline ML models (classification, regression, clustering)
- Document experiments, features, and results clearly
- Work with Large Language Models (LLMs) via APIs (OpenAI, Azure OpenAI, Hugging Face)
- Implement prompt engineering and template creation
- Aid in building RAG pipelines
- Write clean, readable Python code
- Collaborate with cross-functional teams
Required Skills
- Data Preprocessing
- Prompt Engineering
- RAG Pipelines
- Python Programming
- Exploratory Data Analysis (EDA)
Required Skills Explained
- Data Collection, Cleaning and Preprocessing: Understanding how to gather and prepare data for analysis is crucial for any data science project.
- Exploratory Data Analysis (EDA): Performing EDA helps in understanding the underlying structure of data and identifying patterns or outliers before building models.
- Basic Machine Learning Models: Building and evaluating simple machine learning models like classification, regression, and clustering gives hands-on experience with model training and validation.
- Prompt Engineering and Templates for LLMs: Knowledge on how to craft effective prompts for large language models is essential for generating accurate and useful outputs from these AI systems.
- Version Control using Git: Understanding the basics of version control is important for managing code changes and collaborating with other team members.
Who is this for
pThis role is perfect for individuals with a strong interest in generative AI and autonomous agents, specifically those beginning their data science career. Ideal candidates are curious, self-driven learners who enjoy experimenting and solving complex problems.
Why This Job is a Good Opportunity
ulliHands-On Experience in Real-World AI Systems: Gain practical experience working on live projects that involve real-world applications of AI, which can be beneficial for your professional portfolio.liPromotion to Future Roles: The role offers a pathway to more advanced positions as you develop your skills and grow within the company's data science team.liCompetitive Rewards & Benefits: Enjoy competitive salaries along with comprehensive employee benefits, including training and development opportunities that help in career advancement.
Interview Preparation Tips
- Prepare Examples of Your Projects: Be ready to discuss any personal projects or GitHub repositories you have worked on, especially those related to data science and AI.
- Showcase Your Python Skills: Since Python is a core skill for this role, be prepared to explain how you use it in your work and provide code snippets if possible.
- Practice with LLM APIs: Demonstrate your understanding of working with Large Language Models (LLMs) by showing examples of prompt engineering or any relevant experience using LLM APIs.
Career Growth in This Role
pStarting as an intern, you will have the opportunity to learn from experienced data scientists and engineers, which is crucial for building a strong foundation in data science. As your skills grow, there are opportunities to take on more complex projects, leading to advancements into roles such as Junior Data Scientist or Machine Learning Engineer.pThe role also offers insights into various aspects of AI technology and its application in different industries. This exposure can help you explore diverse career paths within the field, whether it's specializing in generative AI, working with LLMs, or developing data-driven decision-making tools.
Explore More Opportunities
Skills
Frequently Asked Questions
What is the duration of this internship?The internship is for a period of 6 months.
What are the expected qualifications?A B.Tech in CS with at least a CGPA of 8 and passing in 2024 or 2025.
Are there any specific tools I need to be proficient in?Familiarity with Python, SQL, pandas, NumPy, scikit-learn, matplotlib/seaborn is required.