Senior Data Scientist
About US:
Foundation AI automatically ingests incoming documents, emails, and attachments from across your firm. It profiles matches, classifies, and saves each to your DMS, and then automates document-dependent workflows according to your rules. Read more about us at www.foundationai.com
Job Overview:
As a Senior Data Scientist, you'll play a pivotal role in researching, developing, and enhancing algorithms that empower computers to learn from text. Your primary focus will be on improving accuracy, reducing costs, and enhancing efficiency through the development of cutting-edge NLP capabilities. By harnessing the power of structured and unstructured data, you'll contribute to the advancement of AI-driven solutions aimed at revolutionizing our client services.
Responsibilities:
- Utilize both structured and unstructured data (text, images, video, audio) to develop AI and ML models.
- Conduct research on emerging AI and ML techniques, with a focus on NLP.
- Hands-on experience with advanced models such as Transformers (BERT, RoBERTa), CNN, LSTM, XGBoost, GPT, YOLO, etc.
- Experience in fine-tuning LLMs for tasks such as Information Extraction and Document Summarization
- Optimize the model performance using techniques such as Quantization, Pruning, LoRA
- Ensure data integrity through processing, cleansing, and verification.
- Apply pattern recognition, knowledge graph, classification, and regression techniques to solve complex problems.
- 3-6 years of hands-on experience in data science, with a strong understanding of Machine Learning principles.
- Proficiency in NLP and experience with BERT family models.
- Familiarity with Deep Learning frameworks like TensorFlow and PyTorch.
- Building and deploying LLMs in Production at scale
- Excellent programming skills in Python, with additional experience in Java and Scala as a plus.
- Strong SQL skills for data manipulation.
- Experience building ML/DL models at scale and proficient in Git.
- Knowledge of Docker/Kubernetes, Web Application Development, PySpark, and Big Data technologies is advantageous.
- A proactive attitude and a passion for making a meaningful impact.
- Minimum 2 years of experience in NLP, including text classification, information extraction, and named entity recognition.
- Proficiency with BERT family models (BERT, RoBERTa, DistilBERT).
- Experience with building LLMs (Open Source/Commercial), such as Mistral, LLaMA, T5, Falcon, Vicuna, MPT, openAI, Claude, BARD.
- Optimally deploying Open Source LLMs in Production
- Strong experience with TensorFlow and PyTorch.
- Expertise in Python programming.
- Working knowledge of PostgreSQL, Docker, Git, MLOps, and building pipelines in model monitoring.
- Demonstrated experience deploying NLP models in production environments.
Certifications:
Education:
- Bachelor’s or master's degree in computer science, Electrical Engineering, Statistics, or related fields from Tier-1 colleges.
Our Commitment:
At Foundation AI, we're committed to creating an inclusive and diverse workplace. We value equal opportunity and affirmative action principles, giving everyone an equal chance to succeed. We're dedicated to offering equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or veteran status. Upholding these values and adhering to applicable laws is paramount to us.
For any feedback or inquiries, please contact us at [email protected]
Learn more about us at www.foundationai.com