The goal of this website is to hold the symptoms and numbers that a brain just can't. Who can be expected to remember all the potential symptoms of the thousands of diseases that exist? Furthermore, it is equally impossible to memorize the utility of the tests designed to diagnose these diseases.

This site was designed and its data collected by a practicing Canadian internist/nephrologist. Data is from an adult population unless otherwise noted.

Details on Disease Symptom Database

This is an imperfect collection of disease symptoms. It was collected by a single reviewer and so mistakes will exist. When possible, sources that mentioned symptoms from primary case series were utilized.

There was some interpretation done of the primary literature. Sometimes, acuity was not mentioned and so an educated guess was made, based on clinical experience, on how the disease would be expected to present. Furthermore, some symptoms mentioned in references were not included when they were thought not to be causatively associated with the disease process.

Symptom frequencies are presented as very common, common, uncommon or rare. These reflect frequencies of >50%, 11-50%, 1-10% and <1% respectively. When possible, symptom frequency data was pulled from primary case series studies. When this was not possible, symptom frequencies were estimated from review article language. In other cases, an educated guess was made based on personal clinical experience. Attempts were made to present symptom frequencies as would be expected at presentation for any given disease. Therefore, some literature frequencies reported in the literature were sometimes reduced to account for expected prevalence at disease presentation. Chronic-type symptoms (ex: coronary artery disease) had their frequencies unchanged.

Details on Incidence Data Collection

Data for a general US population was perferred. Data from other locations was utilized when quality US data could not be found. When data was presented for multiple different time periods, the most recent time period was chosen. When only prevalences could be found, incidences were calculated by roughly estimating the average disease duration. Diseases with onset in childhood or early adulthood without incidence data but with birth frequency data had incidence data estimated by taking the average US birth rate and multiplying by the birth prevalence. When possible, age and sex adjusted incidences were reported. When epidemiologic data could not be found, an educated guess, based on personal clinical experience, was used to roughly estimate the incidence.

Details on Diagnostic Utility Data Collection

The data here is biased. The entire literature was not culled, although meta-analyses, when found, were preferred over individual studies. Attempts were made to find the most recent studies and to include data from those that were strong. If a study did not report likelihood ratios specifically, these were calculated using the excel spreadsheet found here. If specificity or sensitivity was 100% in these studies (meaning one subgroup of the 2x2 table had zero patients), all cells had a value of 0.5 added to prevent a zero or infinite LR without a confidence interval. Confidence intervals are reported to reinforce the fact that an average value only represents a range of values that, in some cases, can vary over a very large range.


Please contact for suggestions and error reporting.