
Articles
Expert-written tutorials and knowledge base.
- Biological LLM Evo 2: Getting StartedLarge language models have transformed knowledge work with a simple idea: Train a model to predict the next token using a lot of data. Similarly, the same idea is poised for a fundamental shift in biology which will open up new ways to do computational biology. However, we will first need to resolve some challenges… Read more: Biological LLM Evo 2: Getting Started
- How to download TCGA cancer dataTCGA is a valuable public database for cancer and pharmaceutical research. If you are new to this public dataset, it may not be obvious to you how to start downloading the data. In this tutorial, you will learn how to download TCGA data. What is TCGA? Before downloading the TCGA data, you must know what… Read more: How to download TCGA cancer data
- Why calibrate machine learning models?A machine learning model’s prediction for disease diagnosis is a risk score ranging from 0 to 1. We impose a decision threshold value to transform it into a Yes/No diagnosis. For example, we call cancer if the score is larger than 0.6, and non-cancer if otherwise. How well can you trust the score? In this… Read more: Why calibrate machine learning models?


