Senior Data Scientist- RW
Designation: Senior Data Scientist
Location: Hyderabad, India
Work Mode: Office
Reporting to: Lead Data Scientist
About US:
Foundation AI automatically ingests incoming documents, emails, and attachments from across your firm. It profiles, matches, classifies, and saves each to your DMS and then automates document-dependent workflows according to your rules. Read more about us at www.foundationai.com
Job Overview:
As a Senior Data Scientist, you'll play a pivotal role in researching, developing, and enhancing algorithms that empower computers to learn from text. Your primary focus will be on improving accuracy, reducing costs, and enhancing efficiency through the development of cutting-edge NLP capabilities. By harnessing the power of structured and unstructured data, you'll contribute to the advancement of AI-driven solutions aimed at revolutionizing our client services.
Responsibilities:
- Use structured and unstructured data (text and images) to develop AI and ML models.
- Conduct research on emerging AI and ML techniques, focusing on NLP.
- Hands-on experience with advanced models such as Transformers (BERT, RoBERTa), CNN, LSTM, XGBoost, GPT, YOLO, etc.
- Experience in fine-tuning LLMs for tasks such as Classification, Information Extraction, and Document Summarization
- Optimize the model performance using techniques such as Quantization, Pruning, LoRA, PEFT, etc.
- Ensure data integrity through processing, cleansing, and verification.
- Apply pattern recognition, knowledge graph, classification, and regression techniques to solve complex problems.
- 3-6 years of hands-on experience in data science, with a strong understanding of Machine Learning principles.
- Familiarity with Deep Learning frameworks like TensorFlow and PyTorch.
- Implementing LLMs in Production at scale
- Strong prompt engineering and optimisation skills.
- Experience using LLM tools like LangChain, LlamaIndex, etc.
- Excellent programming skills in Python, with additional experience in R, Go, or Scala as a plus.
- Strong SQL skills for data manipulation.
- Experience building ML/DL models at scale and proficient in Git.
- Knowledge of Docker/Kubernetes, Web Application Development, PySpark, and Big Data technologies is advantageous.
- A proactive attitude and a passion for making a meaningful impact.
Skills and Tools:
- Minimum 3 years of experience in NLP, including text classification, information extraction, and named entity recognition.
- Proficiency with BERT family models (BERT, RoBERTa, DistilBERT).
- Experience consuming/integrating LLMs (Open Source/Commercial) such as Mistral, LLaMA, T5, Falcon, Vicuna, MPT, openAI, Claude, and BARD.
- Strong experience with TensorFlow and PyTorch.
- Expertise in Python programming.
- Working knowledge of PostgreSQL, Docker, and Git
- Demonstrated experience deploying NLP models in production environments.
- Experience working with AWS Cloud Services
- Experience working with LLM frameworks and Prompt engineering
Education:
Bachelors or Master’s degree in Computer Science, Electrical Engineering, Statistics, or related fields from Tier-1 colleges.
Our Commitment:
At Foundation AI, we're committed to creating an inclusive and diverse workplace. We value equal opportunity and affirmative action principles, giving everyone an equal chance to succeed. We're dedicated to offering equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or veteran status. Upholding these values and adhering to applicable laws is paramount to us.