Pathway Knowledge Base: An Integrated pathway resource using BioPAX

The role of proteins and their function in pathways is crucial to understanding complex biological processes and their failures that lead to disease. With over 200 pathway databases in existence, it is not possible for biologists to examine a pathway in all of them. The emergence and adoption of Biological Pathways Exchange (BioPAX), a standardized format for exchanging pathway information, provides a unique opportunity to integrate knowledge from multiple pathway databases. We conducted a case study integrating multiple pathway databases using BioPAX and Oracle’s resource description framework (RDF) data repository. This integration enables querying across different species and across multiple pathway resources simultaneously. It also enables comparison of the degree of complementary across different pathway sources. We find that BioPAX and RDF are powerful mechanisms for data exchange and integration and are instrumental in enabling an integrated resource. The integrated dataset/s and code for our implementation in this case study available as a resource we name as the pathway knowledge base (PKB, http://pkb.stanford.edu).