You will be collaborating with the CTO, Director of Client Engineering, other developers and the analytics team to develop and implement the vision of the overall platform. The role will primarily focus on extracting structured and unstructured data from complex finance documents using a proprietary extraction framework utilizing proprietary NLP techniques.
In addition, the role also includes the following responsibilities:
- Development of proprietary NLP algorithms utilizing regex, spacy, tries and LLMs
- Development of additional tools and interfaces to create further efficiencies and precision in our data extraction methodology
- Traditional backend work involving anything from REST APIs to database queries
Skills, Qualifications and Experience
The ideal candidate will have at least 4 years’ experience in a start-up environment or a financial institution with strong computer science fundamentals and a minimum of a Bachelor’s degree in a related field.
In particular, the candidate will have:
- Strong Python experience
- Experience with regular expressions and various NLP libraries
- Experience with the one or more of the following python libraries; PDFLib, ply.lex and/or ply.yacc
- Experience with parsing PDF and DocX document types
- Data warehouse understanding
- Experience with DevOps
- Experience with Databases
- Proficiency with code versioning tools, such as Git, Github, Bitbucket
- Experience working with common project management tools and Agile development workflow
- Ingenuity, creativity, drive and determination
- Clear communication skills
- Strong organizational skills, including the ability to respond quickly in a fast-paced environment
- Preferrable but not required: experience with GraphQL
Are you looking for remote jobs near your area? At Yulys, thousands of employers are looking for exceptional talent like yours. Find a perfect job now.