We are looking for a Senior Machine Learning Engineer to join our new Knowledge Enrichment team at BenchSci. You will help design and implement ML-based approaches to analyse, extract and generate knowledge from complex biomedical data such as experimental protocols and from results from several heterogeneous sources, including both publicly available data and proprietary internal data, represented in unstructured text and knowledge graphs. The data will be leveraged in order to enrich BenchSci's knowledge graph through classification, discovery of high value implicit relationships, predicting novel insights/hypotheses, and other ML techniques. You will collaborate with your team members in applying state of the art ML and graph ML/data science algorithms to this data. You are comfortable working in a team that pushes the boundaries of what is possible with cutting edge ML/AI, challenges the status quo, is laser focused on value delivery in a fail-fast environment. You Will:Analyse and manipulate a large, highly-connected biological knowledge graph constructed of data from multiple heterogeneous sources, in order to identify data enrichment opportunities and strategiesWork with data and knowledge engineering experts to design and develop knowledge enrichment approaches/strategies that can exploit data within our knowledge graphProvide solutions related to classification, clustering, more-like-this-type querying, discovery of high value implicit relationships, and making inferences across the data that can reveal novel insightsDeliver robust, scalable and production-ready ML models, with a focus on optimising performance and efficiencyArchitect and design ML solutions, from data collection and preparation, model selection, training, fine-tuning and evaluation, to deployment and monitoringCollaborate with your teammates from other functions such as product management, project management and science, as well as other engineering disciplinesSometimes provide technical leadership on Knowledge Enrichment projects that seek to use ML to enrich the data in BenchSci's Knowledge GraphWork closely with other ML engineers to ensure alignment on technical solutioning and approachesLiaise closely with stakeholders from other functions including product and scienceHelp ensure adoption of ML best practices and state of the art ML approaches at BenchSciParticipate in and sometimes lead various agile rituals and related practicesYou Have:Minimum 5, ideally 8+ years of experience working as an ML engineer in industryTechnical leadership experience, including leading 5-10 ICs on complex projects in industryDegree, preferably PhD, in Software Engineering, Computer Science, or a similar areaA proven track record of delivering complex ML projects working alongside high performing ML engineers using agile software developmentDemonstrable ML proficiency with a deep understanding of how to utilise state of the art NLP and ML techniquesMastery of several ML frameworks and libraries, with the ability to architect complex ML systems from scratch. Extensive experience with Python and PyTorchTrack record of successfully delivering robust, scalable and production-ready ML models, with a focus on optimising performance and efficiencyExperience with the full ML development lifecycle from architecture and technical design, through data collection and preparation, model selection, training, fine-tuning and evaluation, to deployment and maintenanceStrong skills related to implementing solutions leveraging Large Language Models, as well as a deep understanding of how to implement solutions using Retrieval Augmented Generation (RAG) architectureExpertise in graph machine learning (i.e. graph neural networks, graph data science) and practical applications thereof. This is complimented by your experience working with Knowledge Graphs, ideally biological, and a familiarity with biological ontologiesExperience with complex problem solving and an eye for details such as scalability and performance of a potential solutionExperience with data manipulation and processing, such as SQL, Cypher or PandasA growth mindset continuously seeking to stay up-to-date with cutting-edge advances in ML/AI, complimented by actively engaging with the ML/AI community
#J-18808-Ljbffr