Title: Senior Data Scientist
Type: Full Time
Location: LATAM/EU (remote)
About Patexia:
Patexia is a forward-thinking technology company specializing in intellectual property and patent solutions. We are looking for a Senior Data Scientist to join our remote team. In this role, you will design and implement state-of-the-art ranking and similarity scores tailored to the legal sector, leveraging advanced NLP tools and AI technologies to address complex challenges in patent data analysis and entity resolution.
Key Responsibilities:
- Design and develop state-of-the-art ranking and similarity scoring algorithms for the legal sector, utilizing advanced NLP and AI techniques.
- Lead prompt engineering initiatives, leveraging OpenAI's ChatGPT and other platforms, to tackle intricate text processing challenges effectively.
- Utilize your expertise in NLP to drive innovation in patent data analysis, entity resolution, and other related areas.
- Collaborate with cross-functional teams to understand project requirements, devise innovative data science solutions, and ensure successful project outcomes.
- Transition seamlessly between roles of Data Scientist and Data Analyst based on project needs.
- Communicate complex data science concepts clearly and effectively to both technical and non-technical stakeholders.
- Stay up-to-date with the latest advancements in NLP and AI, and evaluate their relevance and potential application to ongoing projects.
- Document solutions thoroughly and contribute to knowledge sharing within the team.
Qualifications & Skills:
Must-Have:
- Minimum of 5 years of experience as a Data Scientist.
- Proven experience in prompt engineering with a strong emphasis on OpenAI's ChatGPT, LLaMA, and Langchain.
- Deep understanding of NLP concepts, methodologies, and their applications such as NER, BERT, Transformers, and LSTM.
- Deep understanding of ML basics such as Regression Analysis, Clustering, Feature Engineering.
- Expertise in designing and implementing machine learning models
- Strong analytical abilities and problem-solving skills.
- Self-motivated and capable of working independently and collaboratively within a remote team environment.
- Experience with BigQuery Machine Learning (BQ ML), Vector databases, Vector Indexing, and Vector Search.