Semantic Web

Semantic Web, from web of documents to web of data. Source: Fensel 2013, slide 15.
Semantic Web, from web of documents to web of data. Source: Fensel 2013, slide 15.

In the original web of the 1990s information was shared as webpages or documents that could be understood by humans. Via hyperlinks, these linked to other webpages or documents anywhere on the web. Servers and desktop computers processed or displayed all this information but didn't understand them. For example, a computer could tell that a particular text is a heading or another text is in italics. It didn't know that the heading is actually the title of a blog post or that the text in italics is the author's name.

Semantic Web is an attempt to describe and link web content in a manner that's meaningful to machines. Semantic Web extends the original web. Semantic Web wants to transform the web from a "web of documents" into a "web of data".

Discussion

  • What's the motivation behind Semantic Web?

    By enabling machines to understand data, we can benefit in many ways:

    • Automation: We can avoid doing mundane stuff such as booking tickets or rescheduling appointments. These can be efficiently handled by virtual assistants or agents.
    • Personalization: Content on the web is growing daily. It's impossible for us to follow everything. Agents can personalize or curate content for us.
    • Information Retrieval: Within enterprises or via web search engines, Semantic Web can give us more relevant answers.
    • Data Reuse: Because Semantic Web enables linking of data from a variety of sources, data can be reused. Data that was previously stored in isolated databases can now be shared in a standard manner.
    • Knowledge Discovery: By linking data across the web, new knowledge can be discovered. Semantic Web enables machines to apply logic on existing relationships and infer new ones. For example, this could be useful in discovering new drugs.
  • With the progress of Machine Learning (ML), is Semantic Web relevant?
    Difference between Semantic Web and Data Science. Source: Lampa 2018, slide 3.
    Difference between Semantic Web and Data Science. Source: Lampa 2018, slide 3.

    Semantic Web doesn't make machines intelligent in the sense of Artificial Intelligence or Machine Learning. Instead of asking machines to understand humans, we help machines to solve well-defined problems on well-defined data via well-defined operations.

    It therefore does appear that AI/ML has gone ahead and enabled machines to see, hear and speak. The mid-2010s have seen the arrival of voice assistants, chatbots, computer vision applications, and more. This has been possible because of the availability of data to train ML algorithms. Though some ML algorithms require labelled data for training, there's no need to add semantic metadata to all data.

    However, Semantic Web complements and even overlaps with AI/ML. Semantic Web can help with explainable AI. Natural Language Processing (NLP) is an application area with chatbots and intelligent assistants using semantically linked data. Semantic Web can add background knowledge to AI/ML systems, particularly in areas where data is scarce. We're also seeing AI/ML being applied to conceptualize domain knowledge for the Semantic Web. AI can fill in missing data or clean noisy data in knowledge graphs.

  • What are the basic building blocks of Semantic Web?
    The Semantic Web Stack (aka Semantic Web Cake). Source: Idehen 2017.
    The Semantic Web Stack (aka Semantic Web Cake). Source: Idehen 2017.

    Semantic Web builds upon the foundations of the original web. Data (old and new) must be described with metadata. This metadata will identify data, interlink data and relate data to concepts so that machines can understand them.

    Data must be uniquely identified and this is done using Uniform Resource Identifier (URI) or Internationalized Resource Identifier (IRI). Resource Description Framework (RDF) provides the data model. Meaning is added at a higher layer with what we call ontologies. In other words, RDF specifies the syntax while ontologies specify the semantics.

    Just as HTML was the building block of the original web, RDF is the building block of the Semantic Web. Web content can expose their semantics by embedding RDF statements within webpages. There are many ways to do this: RDFa, RDF-XML, RDF-JSON, JSON-LD, Microdata, etc. Semantic data already processed and stored in RDF format can be queried. Just as MySQL exists to query relational databases, SPARQL is a language to query RDF stores. Given the semantics, rules can help in applying logic and reasoning.

  • What is Resource Description Framework (RDF)?
    RDF Triple: Subject, Predicate, Object. Source: Herrmann 2011.
    RDF Triple: Subject, Predicate, Object. Source: Herrmann 2011.

    As the name suggests, RDF helps us describe any resource so long as that resource has a unique identifier. In other words, RDF helps us define data about other data, that is, the metadata.

    RDF has three components: subject, predicate, object. It's a statement about the relationship between the subject and the object. Thus, the fact that Villa Nellcôte is located in France can be expressed as an RDF Triple. All three parts of the triple are expressed as URIs, literals or blank nodes.

    When we combine many such statements together, we get what is called an RDF Graph. Subjects and objects are nodes of the graph. Predicates form the connecting arcs. For example, we can state that France is in Europe, Paris is the capital of France, Paris has a population of 2.2 million... Each of these can be expressed as an RDF Triple. Collectively, they form an RDF Graph.

  • How do we add meaning to data on the web?
    Semantic Web adds links data on the web and adds a layer of meaning to data. Source: Petkova 2016.
    Semantic Web adds links data on the web and adds a layer of meaning to data. Source: Petkova 2016.

    RDF on its own doesn't give meaning to data. RDF is a data model, a method to express relationships. To give meaning, vocabularies and ontologies are defined. These are typically written in terms of classes, their properties and relationships to other classes.

    For example, an RDF triple can express that Paris is the capital of France but for a computer this still makes no sense. A vocabulary can define that capital is a type of city, city belongs to a country, and country is a political entity. This helps the computer to get a sense of the context though it can never truly understand the way humans do.

    RDF Schema (RDFS) is a simple vocabulary while Web Ontology Language (OWL) is more powerful.

  • Could you explain the term ontology?
    What is ontology? Source: SpryKnowledge 2011.

    Ontology has a metaphysical meaning but in computer science it refers to a formal description of knowledge. Concepts and their relationships within a specific domain are described. Classes, attributes, and relations such as restrictions, rules and axioms are defined. These represent the knowledge of that domain.

    For example, in the domain of education, we can define that a course is taught by an academic staff member; academic staff member is a subclass of staff member; the union of staff member and student is all people at the university; academic staff member is equivalent to faculty; and so on. If we state that John Smith teaches Chemistry 101, using the ontology we can infer that John Smith is a staff member.

    Data coming from different sources can rely on ontologies for a common understanding. Ontologies can also help in finding logical inconsistencies, classes that cannot have instances, or different instances that share the same names. One important benefit is that logic and reasoning can be applied to discover new knowledge.

  • Could you describe some real-world applications of Semantic Web?

    It was reported in 2007 that the oil and gas industry is using RDF/OWL to combine data from diverse sources, and standardize data exchange, sharing and integration across partners or applications. Collaborative knowledge management also became possible.

    In April 2010, Facebook launched Open Graph that web publishers can use to integrate their web pages into Facebook's social graph. This enables Facebook to understand what a user likes, give personalized recommendations, or connect users with similar interests. A simplified form of RDFa was adopted.

    During the FIFA World Cup of 2010, BBC website used semantic web technologies to dynamically display content. SPARQL queries and OWL 2 RL reasoning were employed. With the success of this project, in January 2013, BBC committed to the development of Linked Data Platform to enable dynamic semantic publishing. BBC's Music site from 2008 was also an early example of using the semantic web.

    Other examples include causality mining in pharma, semantic web mining, mining health records for insights, and fraud detection. You may also look up W3C's page titled Semantic Web Case Studies and Use Cases for more examples.

Milestones

1997

W3C produces the first working draft of RDF Model and Syntax. This evolves to a 2004 W3C Recommendation titled Resource Description Framework (RDF): Concepts and Abstract Syntax.

1998

A working draft of RDF Schema (RDFS) is published by W3C. Since RDF itself lacks means to define semantics, RDFS provides a basic type system for use in RDF models. RDF Schema 1.1 is published in February 2014 as a W3C Recommendation.

May
2001

American science magazine Scientific American publishes an article by the inventor of the web, Tim Berners-Lee. Titled The Semantic Web, this article talks about web content that's meaningful to computers. The article narrates futuristic interactions enabled by the semantic web. It discusses knowledge representations, ontologies and agents. Though Berners-Lee had discussed some of these as early as 1994, it's only with this article that the vision of semantic web reaches a wide audience.

2004

W3C publishes Web Ontology Language (OWL). OWL enables complex relationships that are not possible with RDFS. A revision of this (often called "OWL 2") is published in 2009. A second edition of OWL 2 is published in 2012.

2006

Tim Berners-Lee points out that putting data out on the web isn't enough. Data has to be linked to other related data. He proposes what he calls Linked Open Data (LOD) and defines a 5-star system to grade how well people share data. He also states that,

Linked Data is the Semantic Web done right.
Jan
2007

DBpedia is created by using structured data available in Wikipedia and representing them as RDF triples. By using a query language such as SPARQL, DBpedia can be used to make semantic queries. As on April 2016, DBpedia contains 9.5 billion RDF triples.

2010

This may be the year when many companies adopt semantic web technologies for commercial use. Examples include Best Buy, BBC World Cup site, Google, Facebook and Flipboard.

Jun
2011

Google, Microsoft, Yahoo and Yandex agree on Schema.org, a vocabulary for associating meaning to data on the web. The vocabulary is defined by a community process. Schema.org can be used with various encodings including RDFa, Microdata and JSON-LD. Google itself recommends the use of Schema.org along with JSON-LD.

Oct
2011

Apple integrates voice assistant Siri into some of its products. Within the next couple of years, competitive voice assistants Microsoft Cortana, Google Now and Amazon Alexa are released. These are largely powered by machine learning techniques but may also leverage the semantic web technologies.

Sep
2013

Google Search gets the Hummingbird update, which enables the search engine to figure out user's intent. This is beyond searching by keywords alone. This capability comes from semantics attached to data plus Google's Knowledge Graph. Knowledge Graph was announced in May 2012.

2021

Given that 2001 is considered the birth year of Semantic Web (with the publication of the Scientific American article), Semantic Web celebrates its 20th anniversary. In an article, Hitzler writes that the grand goal of data sharing, discovery, integration and reuse hasn't been achieved. Every sub-field of Semantic Web needs further progress. Given many varied approaches, some application-oriented consolidation is needed with good interworking of tools and well-documented processes.

Sample Code

  • <!-- 
    Example of using Schema.org encode as JSON-LD
    Source: https://developers.google.com/search/docs/guides/intro-structured-data
    -->
    <script type="application/ld+json">
    {
      "@context": "http://schema.org",
      "@type": "Organization",
      "url": "http://www.example.com",
      "name": "Unlimited Ball Bearings Corp.",
      "contactPoint": {
        "@type": "ContactPoint",
        "telephone": "+1-401-555-1212",
        "contactType": "Customer service"
      }
    }
    </script>
     

References

  1. Berners-Lee, Tim. 1998. " What the Semantic Web can represent." Design Issues, W3C. Accessed 2018-05-27.
  2. Berners-Lee, Tim. 2006. "Linked Data." W3C, July 27. Accessed 2018-05-27.
  3. Berners-Lee, Tim, James Hendler, and Ora Lassila. 2001. "The Semantic Web." Scientific American, May. Accessed 2018-05-27.
  4. Bohn, Dieter. 2016. "Google Home: a speaker to finally take on the Amazon Echo." The Verge, May 18. Accessed 2017-05-27.
  5. Brickley, Dan, and R.V. Guha. 2014. "RDF Schema 1.1." W3C Recommendation, February 25. Updated 2017-10-02. Accessed 2018-05-27.
  6. Brickley, Dan, R.V. Guha, and Andrew Layman. 1998. "Resource Description Framework (RDF) Schema Specification." W3C Working Draft, October 30. Updated 2017-10-02. Accessed 2018-05-27.
  7. Cabeda, José. 2017. "Semantic Web is Dead, Long live the AI!!!" Hackernoon, June 4. Accessed 2017-05-27.
  8. Chum, Frank. 2007. "Use Case: Ontology-Driven Information Integration and Delivery: A Survey of Semantic Web Technology in the Oil and Gas Industry." W3C, April. Accessed 2018-05-28.
  9. DBpedia Wiki. 2016. "DBpedia version 2016-04." April. Accessed 2018-05-27.
  10. DUNAVTECH. 2010. "History and Context." Semantic Web, DUNAVTECH, August 18. Accessed 2018-05-26.
  11. Dormehl, Luke. 2017. "Today in Apple history: Siri debuts on iPhone 4s." Cult of Mac, October 4. Accessed 2018-05-27.
  12. Duerst, M. and M. Suignard. 2005. "Internationalized Resource Identifiers (IRIs)." RFC 3987, January. Accessed 2018-12-01.
  13. Feigenbaum, Lee, Ivan Herman, Tonya Hongsermeier, Eric Neumann and Susie Stephens. 2007. "The Semantic Web in Action." Scientific American, December, pp. 90-97. Accessed 2020-05-27.
  14. Fensel, Anna. 2013. "The Semantic Web Exists. What Next?" Conquering Data Workshop, STI Innsbruck, University of Innsbruck, 17 October. Accessed 2021-01-27.
  15. Fox, Vanessa. 2011. "Schema.org: Google, Bing & Yahoo Unite To Make Search Listings Richer Through Structured Data." Search Engine Land, June 2. Accessed 2018-05-25.
  16. Google Developers. 2017. "Introduction to Structured Data." September 13. Accessed 2018-05-27.
  17. Heath, Tom. 2009. "Linked Data? Web of Data? Semantic Web? WTF?" Tom Heath's Displacement Activities (blog), March 9. Accessed 2018-05-27.
  18. Herrmann, Joseph. 2011. "The Semantic Web." Object Computing, Inc, February. Accessed 2018-05-27.
  19. Hitzler, Pascal. 2021. "A Review of the Semantic Web Field." Communications of the ACM, vol. 64, no. 2, pp. 76-83, February. Accessed 2021-01-27.
  20. Hoehndorf, Roberta and Núriac Queralt-Rosinach. 2017. "Data Science and symbolic AI: Synergies, challenges and opportunities." Data Science, vol. 1, no. 1-2, pp. 27-38. Accessed 2017-05-27.
  21. Hosch, William L. 2009. "Semantic Web." Encyclopædia Britannica, August 06. Accessed 2018-05-26.
  22. Idehen, Kingsley Uyi. 2017. "Semantic Web Layer Cake Tweak, Explained." OpenLink Software, via Medium, July 14. Accessed 2018-05-25.
  23. Klyne, Graham and Jeremy J. Carroll. 2004. "Resource Description Framework (RDF): Concepts and Abstract Syntax." W3C, February 10. Accessed 2018-05-26.
  24. Lampa, Samuel. 2018. "Semantic Web ❤ Data Science? - Practical large scale semantic data handling with RDFIO and RDF-HDT." SlideShare, April 10. Accessed 2017-05-27.
  25. Macmanus, Richard. 2010a. "Facebook & The Semantic Web." ReadWrite, July 1. Accessed 2018-05-28.
  26. Macmanus, Richard. 2010b. "Top 10 Semantic Web Products of 2010." ReadWrite, December 29. Accessed 2018-05-28.
  27. Moz. 2018. "Google Hummingbird." Moz, May 25. Accessed 2018-05-27.
  28. Mutchler, Ava. 2017. "Voice Assistant Timeline: A Short History of the Voice Revolution." Voicebot.ai, July 14. Accessed 2018-05-27.
  29. Ontotext. 2015. "BBC Uses Semantic Publishing to Power the FIFA World Cup Web Site." August. Accessed 2018-05-28.
  30. Ontotext. 2018. "What are Ontologies?" March 23. Accessed 2018-05-27.
  31. Paschke, Adrian. 2013. "The Nature of Information." SlideShare, November 14. Accessed 2017-05-27.
  32. Petkova, Teodora. 2016. "A Web of People and Machines: W3C Semantic Web Standards." Ontotext, March 24. Accessed 2018-05-25.
  33. Schema.org. 2018. "Welcome to Schema.org." Accessed 2018-05-27.
  34. Schreiber, Guus, and Yves Raimond. 2014. "RDF 1.1 Primer." W3C Working Group Note, June 24. Accessed 2018-05-27.
  35. Scott, Tom. 2008. "The all new BBC music site where programmes meet music and the semantic web." Derivadow, July 28. Accessed 2018-05-28.
  36. Singhal, Amit. 2012. "Introducing the Knowledge Graph: things, not strings." Google Blog, May 16. Accessed 2018-05-25.
  37. SpryKnowledge. 2011. "What is an Ontology." YouTube, April 4. Accessed 2018-05-26.
  38. Sweeney, Peter. 2016. "The History of the Semantic Web is the Future of Intelligent Assistants." Medium, August 20. Accessed 2018-05-27.
  39. W3C Wiki. 2012. "Web Ontology Language (OWL)." W3C, December 11. Accessed 2017-05-27.
  40. Wikipedia. 2018. "DBpedia." Wikipedia, May 6. Accessed 2018-05-27.
  41. eTutorials.org. 2018. "Practical resource description framework (rdf)." Accessed 2018-05-26.

Further Reading

  1. Berners-Lee, Tim, James Hendler, and Ora Lassila. 2001. "The Semantic Web." Scientific American, May. Accessed 2018-05-27.
  2. Idehen, Kingsley Uyi. 2017. "Semantic Web Layer Cake Tweak, Explained." OpenLink Software, via Medium, July 14. Accessed 2018-05-25.
  3. Schreiber, Guus, and Yves Raimond. 2014. "RDF 1.1 Primer." W3C Working Group Note, June 24. Accessed 2018-05-27.
  4. eTutorials.org. 2018. "Practical resource description framework (rdf)." Accessed 2018-05-26.
  5. Paschke, Adrian. 2013. "The Nature of Information." SlideShare, November 14. Accessed 2017-05-27.
  6. Singhal, Amit. 2012. "Introducing the Knowledge Graph: things, not strings." Google Blog, May 16. Accessed 2018-05-25.

Article Stats

Author-wise Stats for Article Edits

Author
No. of Edits
No. of Chats
DevCoins
4
1
4274
3
3
149
1
1
32
1903
Words
38
Likes
58K
Hits

Cite As

Devopedia. 2022. "Semantic Web." Version 8, February 15. Accessed 2023-11-12. https://devopedia.org/semantic-web
Contributed by
3 authors


Last updated on
2022-02-15 11:51:02
  • Semantic Search
  • Semantic Web for Developers
  • Linked Open Data
  • Web Ontology Language
  • Resource Description Framework
  • Social Web