Matthew Jacobson

About

Hi! My name is Matthew Jacobson.

I am an experienced software developer and data scientist with a history of working in both a professional and research setting. The strengths I bring to my work are not particular to any specific discipline but to the field of scientific and data-driven programming itself.

Currently, I am a senior software engineer at Grafana Labs where we make an open and composable monitoring and observability stack built around Grafana, the leading open source technology for dashboards and visualization. I work with the Alerting Team to improve the alerting experience across all Grafana products (OSS, Cloud, and Enterprise).

My previous work includes:

Leading a team of engineers in MindGeek's Data Services department, working on various high-performance, low-latency APIs as well as web-scale machine-learning pipelines in the field of audio and visual fingerprinting and digital forensics.

Contracting with the University of British Columbia to develop web-applications based on research in the field of bioinformatics.

Working as a senior software developer in the Pavlidis Bioinformatics Lab to architect and implement research-oriented applications dealing in large-scale computations and/or data. Furthermore, I would both engage in neuroinformatics research and mentor staff in the benefits of applying sound development practices.

Working as an analyst and software developer in the financial sector at Pivotal Payments. My role there was to develop programmatic solutions to automate operations tasks while acting as a liaison between financial reporting and the development teams.

Projects

Web Applications

The Rare Diseases: Models & Mechanisms Network Registry GitHub

The Rare Diseases: Models & Mechanisms Network has been established to catalyze connections between people discovering new genes in patients with rare diseases, and basic scientists who can analyze equivalent genes and pathways in model organisms. Catalyst Grants fund projects that will allow rapid confirmation of potentially disease-causing genes, and fuel pilot studies to improve understanding of how specific gene mutations cause disease. It is intended that collaborations across the Canadian biomedical community will expedite the understanding of disorders, enabling the design of new therapies to the ultimate benefit of those affected by rare diseases.

GOtrack GitHub

A web-based system and database that provides access to historical records and trends in the Gene Ontology and GO annotations (GOA). GOTrack gives users access to gene- and term-level information on annotations for nine model organisms (including human) as well as an interactive tool that measures the stability of enrichment results over time for user-provided “hit lists” of genes. Capable of performing and visualizing real-time statistical analyses on large datasets.

Gemma GitHub

Gemma is a set of tools for genomics data meta-analysis, currently primarily targeted at the analysis of gene expression profiles. Gemma contains data from thousands of public studies, referencing thousands of published papers. Users can search, access and visualize coexpression and differential expression results.

ASPIREdb GitHub

ASPIREdb is a web-based software system for the analysis and visualization of genomic variants (CNVs, SNVs, and Indels) and phenotypes. Potential users include researchers who are trying to understand the relationships between discovered variants and phenotypes.

VariCarta GitHub

Web application and database providing a harmonized and comprehensive collection of of human DNA genetic variants found in individuals diagnosed with Autism Spectrum Disorder (ASD) and reported in peer-reviewed scientific literature.

MoRF GitHub

Web-based queuing system used by a computational cluster for finding fast and accurate predictions of molecular recognition features (MoRFs) in an amino acid query sequence.

IDRBind GitHub

IDRBind is a protein interface predictor for binding sites of intrinsically disordered protein regions (IDRs), ranging from short linear motifs (SLiMs) to molecular recognition features (MoRFs).

LIST GitHub

LIST predicts the deleteriousness of amino acid mutations in human proteomes. The current release, 0.9 includes precomputed predictions of ~20,000 SwissProt protein sequences identified by their UniProt accession number. Prediction scores in the range [0 .. 1], where lower scores indicate more benign and higher indicate deleteriousness effect.

LIST 2.0 GitHub

LIST 2.0 predicts the deleteriousness of amino acid mutations in human protein sequences. First, it aligns (pair-wise alignment) the query sequence to all protein sequences in the UniProt Swiss-Prot/TrEMBL database and then it identifies the most relevant homologies based on their local identity to the query sequence around the position of the target mutation. And finally, it estimates the potential deleteriousness of mutations based on Taxonomy distance of species with variations to the query.

Resume

Education

McGill University 2009 - 2013

Bachelor of Science

Joint Honours Mathematics & Physics

Marianopolis College 2007 - 2009

Diploma of College Studies

Pure & Applied Science

Work Experience

Grafana Labs 2022 - current

Senior Software Engineer

Develop primarily in GO with the Backend Alerting Team to improve the alerting experience across all Grafana products (OSS, Cloud, and Enterprise).
Provide on-call support and expert assistance in Alerting for internal systems and external customers.

MindGeek - Data Services

Software Development Manager 2021 - 2022

Lead Software Developer 2020 - 2021

Senior Software Developer 2019 - 2020

Develop new features for existing projects handling hundreds of billions of events daily.
Assist junior members of the team when needed.
Participate in the full software development life cycle (design, implementation, integration, and interaction with quality assurance).
Write well designed, testable, efficient code.
Provide expert advice and assistance to other teams in the company.

Contract Software Developer 2018 - 2022

Software Developer

Develop web applications that implement research-driven computational solutions.
Deliver full-stack solutions from systems architecture to database design to backend and frontend layers.

Centre for High-Throughput Biology, UBC 2014 - 2018

Software Developer

Lead development of new applications.
Architect web-applications and infrastructure in the field of bioinformatics.
Design and assess computational methods, networks, databases and tools.
Manage multiple replicate database servers, CI servers, and web servers.
Engage in neuroinformatics research.
Mentor staff and students in the benefits of sound development practices.

Pivotal Payments 2008 - 2011

Analyst & Developer

Liaison between financial reporting and IT, developing programmatic solutions to streamline repeated operations tasks and automate financial reporting systems.
Improved pipelines for internal billing of U.S. merchants, merchant pricing in new acquisitions and HR benefits programs.

Papers

Curation of over 10,000 transcriptomic studies to enable data reuse

N Lim, S Tesar, M Belmadani, G Poirier-Morency, BO Mancarci, J Sicherman, M Jacobson, et al.

PREPRINT; bioRxiv 2020.07.13.201442
LIST-S2: taxonomy based sorting of deleterious missense mutations across species

N Malhis, M Jacobson, SJM Jones, J Gsponer

Nucleic acids research 48.W1 (July 2020) W154-161, 2020
Whole genome sequencing and variant discovery in the ASPIRE autism spectrum disorder cohort

DB Callaghan, S Rogic, PPC Tan, K Calli, Y Qiao, R Baldwin, M Jacobson, et al.

Clinical genetics (2019). Wiley Online Library, 2019
VariCarta: A comprehensive database of harmonized genomic variants found in autism spectrum disorder sequencing studies

M Belmadani, M Jacobson, N Holmes, M Phan, T Nguyen, P Pavlidis, S Rogic

Autism Research 12.12 (2019) pp. 1728–1736. 2019
Monitoring changes in the Gene Ontology and their impact on genomic data analysis

M Jacobson, AE Sedeño-Cortés, P Pavlidis

Gigascience 7.8 (Aug. 2018). 2018
Interactive Exploration, Analysis, and Visualization of Complex Phenome–Genome Datasets with ASPIREdb

PPC Tan, S Rogic, A Zoubarev, C McDonald, F Lui, G Charathsandran, M Jacobson, et al.

Human mutation 37.8 (2016) pp. 719–726. Wiley Online Library, 2016
MoRFchibi SYSTEM: software tools for the identification of MoRFs in protein sequences

N Malhis, M Jacobson, J Gsponer

Nucleic acids research 44.W1 (July 2016) W488-493, 2016