Scott Walmsley Exploring big data in a molecular world

Capabilities

Former molecular biologist turned informaticist and computer programmer. I currently perform informatic artistry in the world of molecular data analysis combining my right brain creative influx with my left brain discipline.

Database

Access to and storage of data is paramount for analyzing data. My current toolbox includes experience in SQL, NoSQL (document stores), and Graph databases. Graph QL is a recent addition.

Code

Core skills in any informatic scheme include being able to code. After enrolling in Apple's Computers for Kids (BASIC, ca 1980), experience was gained in object oriented languages in support of analytic and algorithmic tasks.

Analytics

Analytic techniques require scripting ability and ways to visualize data. R and Python are the main recipies for data analytics and prototyping methods. Javascript, D3.js, RStudio and Shiny are the canvas for presentation.

Featured Projects

mountains

HMDB molecular network amalgamation

Building of a graph database with Neo4J and MongoDB using Java, R, and D3.js. Metabolic pathway relationships were built using SMPDB with visualizations provided on the basis of chemical classifications. The tool is designed to extract sub-networks determined by expression data.

mountains

Rapid Lewis formula identification from static databases using isotopic distributions

Using theoretical isotope distrubutions of chemical formulae, we assembled a self correcting search algorithm via a global model method to improve accuracy and speed of molecular formula assignment.

Check it out