At the forefront of global genomics research, the Wellcome Sanger Institute is driving some of the most ambitious scientific programmes in the world. From decoding disease to mapping biodiversity, its work depends on the ability to process and manage vast volumes of complex biological data at speed and scale.
As the Institute expanded its Tree of Life programme, an initiative to sequence tens of thousands of species, it faced a critical challenge: how to efficiently manage a rapidly growing influx of samples without slowing down scientific progress.
Working in close partnership, NashTech helped the Institute move from fragmented manual processes to a streamlined, scalable digital platform, enabling researchers to focus on discovery, not administration.
Under its Tree of Life programme, the Wellcome Sanger Institute is leading the initiative to decipher the genomes of 72,000 living organisms found in Britain and Ireland. This huge undertaking involves receiving many thousands of samples from a wide range of partners around the country: museums, botanical gardens, research organisations, universities and more.
Rather than depending on manual spreadsheets and processes to record and track samples – each of which may have around 50 pieces of related meta-data – the Institute needed a software system that would enable them to efficiently, quickly and effectively manage the journey of each and every sample through the process, giving team members across different departments access to upto-date and real-time information whenever they need it.
We began by running a comprehensive discovery programme to work through the as-is situation with Institute team members and fully understand the desired end state.
In 10 weeks an MVP of the solution was ready for go-live. Since then we have continued to optimise and enhance specific features as needed in the spirit of continuous improvement, such as linking downstream sequence data back to each sample, as well as expanded the functionality of the system.
We have delivered a bespoke software solution to the Institute which enables team members to track every single sample received on an end-to-end basis.
Hosted in a private cloud, there is a single user interface. Written in Open Source code (Python), the open API used means that the system is easy to maintain, fully scalable and can be securely connected to the wider European and rest of the world databases as they are developed under the programme.
The speed, efficiency and accuracy of the project has been significantly increased and samples, along with their meta data, can be logged and recorded in much less time than previously.
There is much less room for error or inaccuracy now that manual processes and data entry have been removed. The Institute has a system that can easily be configured to other projects too – thus representing a strong return on investment.
“It has been a real pleasure working with NashTech. They took the time to collaborate with the team and understand what we needed, fully documenting processes and creating a detailed solutions architecture. There is no way we could have scaled to where we are, in the accelerated timescale it’s been achieved, without NashTech.”
Kenneth Haug, Enabling Platforms Team Lead – System Owner at Wellcome Sanger Institute