Senior Data Scientist
About US:
Foundation AI is the only AI Native documents intake automation platform serving the claims and litigation industries. Founded in 2019 by a team of lawyers and data scientists, Foundation AI processes millions of documents each month for hundreds of US law firms, including many of the largest and most respected plaintiff and injury law firms in the country. Find out more at www.foundationai.com.
Job Overview:
As a Senior Data Scientist, you'll play a pivotal role in researching, developing, and enhancing algorithms that empower computers to learn from text. Your primary focus will be on improving accuracy, reducing costs, and enhancing efficiency through the development of cutting-edge NLP capabilities. By harnessing the power of structured and unstructured data, you'll contribute to the advancement of AI-driven solutions aimed at revolutionizing our client services.
Key Responsibilities:
- Use structured and unstructured data (text and images) to develop AI and ML models.
- Conduct research on emerging AI and ML techniques, focusing on NLP.
- Hands-on experience with advanced models such as Transformers (BERT, RoBERTa), CNN, LSTM, XGBoost, GPT, YOLO, etc.
- Experience in fine-tuning LLMs for tasks such as Classification, Information Extraction, and Document Summarization
- Optimize the model performance using techniques such as Quantization, Pruning, LoRA, PEFT, etc.
- Ensure data integrity through processing, cleansing, and verification.
- Apply pattern recognition, knowledge graph, classification, and regression techniques to solve complex problems.
- 3-6 years of hands-on experience in data science, with a strong understanding of Machine Learning principles.
- Familiarity with Deep Learning frameworks like TensorFlow and PyTorch.
- Implementing LLMs in Production at scale
- Strong prompt engineering and optimisation skills.
- Experience using LLM tools like LangChain, LlamaIndex, etc.
- Excellent programming skills in Python, with additional experience in R, Go, or Scala as well as strong SQL skills for data manipulation.
- Experience building ML/DL models at scale and proficient in Git.
- Knowledge of Docker/Kubernetes, Web Application Development, PySpark, and Big Data technology is advantageous.
- A proactive attitude and a passion for making a meaningful impact.
Responsibilities may be tailored based on the candidate’s experience and proficiency.
Skills and Tools:
- Minimum 3 years of experience in NLP, including text classification, information extraction, and named entity recognition.
- Proficiency with BERT family models (BERT, RoBERTa, DistilBERT).
- Experience consuming/integrating LLMs (Open Source/Commercial) such as Mistral, LLaMA, T5,
- Falcon, Vicuna, MPT, openAI, Claude, and BARD.
- Strong experience with TensorFlow and PyTorch.
- Expertise in Python programming.
- Working knowledge of PostgreSQL, Docker, and Git
- Demonstrated experience deploying NLP models in production environments.
- Experience working with AWS Cloud Services
- Experience working with LLM frameworks and Prompt engineering.
Education:
Bachelor's or Master’s in Computer Science, Electrical Engineering, Statistics, or equivalent field.
Our Commitment:
Foundation AI is an equal opportunity employer committed to diversity and inclusion in the workplace. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic. Our hiring decisions are based solely on qualifications, merit, and business needs at the time.
For any feedback or inquiries, please contact us at [email protected]
Learn more about us at www.foundationai.com