Building the Semantic Web at NASA: People, Organizations, Projects, and Skills
This presentation, which is aimed at executives and managers, concentrating on the “big picture”, describes the data problem in NASA and some of the ways NASA is using Semantic Web technologies to tackle the data problem.
NASA has a data problem
Size
There’s a practically unimaginable amount of NASA data: unknown terabytes of data across disparate sites and in every data format possible.
Scale
- structured, unstructured, semistructured
- sensor and experimental data output from every conceivable kind of device, including thousands of bespoke or one off devices
- metadata embedded in mountains of imagery (photographs, video, etc.)
- all the usual “enterprise data” associated with an organization of NASA’s size and scope
- data about people, their skills, and ongoing projects, including teams across dozens of sites, thousands of experts, and hundreds of thousands of existing and historical projects
- it’s very complex and needs to be integrated in many ways.
For example, when planning to return to the Moon, NASA analysts have to use lunar planetology data to find potential landing sites. But what makes a landing site suitable is a complex question, involving seemingly (but not really) disparate data about lunar topography and geology, but also about people, their skills, historic projects, sensor data, as well as mission goals under various scenarios.
Importance
NASA’s data matters: it’s vital to helping humanity understand its place in the universe. Few human projects matter more.
Diversity
NASA’s data is very public and belongs not only to the US taxpayer but, also, in some sense, to the world. I18N issues matter, as do accessibility issues.
NASA has been working on its data problem
- Metadata & KM councils and best practice groups
- Catalogs, libraries, dictionaries of data collections
- Hence, lots of well-curated data collections
- Scientific and resarch communities with big stakes in getting things right
- But much of this information isn’t Web-accessible or machine-readable or sufficiently expressive
NASA’s Using Semantic Web technologies
- RDF to manage agile data integration
- OWL ontologies to describe and reason about complex planetary data
- Best-of-breed Semantic Web tools like mSpace to manage complex knowledge
POPS: A Case Study
- Live demo of POPS (People, Organizations, Projects, Skils)
- Starting modestly to build attractor services
- Loosely coupled with existing enterprise architecture
- Deployment as simple web service
- Browse, search, then query
- Using mSpace to allow users to navigate POPS data to discover novel connections and relationships
- Social network visualizer in JSpace, a Java clone of mSpace
What’s Next?
- More RDF-based data integration
- Add search and ad hoc query (adopting SPARQL)
- More OWL to describe and reason about more complex knowledge domains
- Refactoring success stories into infrastructure services (search, query, OWL reasoning)




