SlideShare a Scribd company logo
Università degli studi di Bari “Aldo Moro”
                              Dipartimento di Informatica




             ConNeKTion: A Tool for Exploiting Conceptual
               Graphs Automatically Learned from Text
                                F. Leuzzi, S. Ferilli, F. Rotella
L.A.C.A.M.                 {fabio.leuzzi, stefano.ferilli, fulvio.rotella}@uniba.it
http://lacam.di.uniba.it

                           9th Italian Research Conference on Digital Libraries
                                    Università la Sapienza - Rome, Italy
                                       January 31 - February 1, 2013
Overview

             ●   Introduction & Objectives
             ●   Tool overview
             ●   Knowledge Representation Formalism
             ●   Relevant concepts
             ●   Information Retrieval
             ●   Reasoning by Association
             ●   Exploiting Tool
             ●   Conclusions & Future Works



ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     2
Introduction
      Some repositories leave the responsibility of quality to the authors.


                                                  +
                   Anybody can produce and distribute documents.


                                                  =
              Possible low average quality of the repository contents.




The study, understanding and exploitation of the content of a digital library,
with the aim to easily explore the semantic content of huge amounts of text.



ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     3
Introduction
         Possible solution:
 ●       Natural Language Processing systems
     ●    Provide the grammatical structures contained in text
 ●       Knowledge Representation formalisms
     ●    Semantic networks
 ●       Graph learning techniques
     ●    To obtain a semantic network starting from the text
 ●       In order to satisfy the information needs, the knowledge base
         can be exploited:
     ●    To make summarizations
     ●    To reason with it
     ●    ...

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     4
Objectives

                                 Improving fruition of a DL


         ●       Use of a tool providing advanced functionalities
             ●       Mixed strategy for relevant concept recognition
             ●       Semantic approach to information retrieval
             ●       Automatic inference over the acquired knowledge
                 ●    Reasoning by association




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     5
Tool overview




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     6
Knowledge representation
                       formalism
    Only subject, verb and complement have been considered.
    ●   Subjects and complements → concepts
    ●   Verbs → relations between them




               subject,                                    subject,
                verb,                                    complement
             complement



    The frequency of arcs between the concepts in positive and negative
    sentences has been taken into account.
    ●   Enrich the representation formalism
    ●   Give robustness to our solution through a statistical approach
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     7
Relevant Concepts

   ● Relevant nodes are sought in the graph

   ● Mixed strategy
      ●       Semantic network structure
      ●       EM clustering provided by Weka
      ●       Keyword Extraction
          ●    Quantitative approach based on co-occurrences
          ●    Qualitative approach exploiting WordNet
          ●    Psychological approach based on principles of an effective
               presentation
   ● Components empirically weighted



ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     8
Information Retrieval
   ●       Word Sense Disambiguation
       ●       One Domain per Discourse assumption: many uses of a word in a
               coherent portion of text tend to share the same domain
       ●       Prevalent domain individuation
           ●       Extraction of all synsets for each term
           ●       Extraction of all domains for each synset
           ●       Choice of prevalent domain synset
   ●       Pairwise Complete Link Agglomerative Clustering
       ●       Each synset generates a singleton cluster
       ●       For each pair of clusters
           ●       If the complete link property holds
               ●   Merge the involved clusters



ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     9
Information Retrieval
   ●       Multi-strategy Similarity Measure on WordNet
   ●       3 components summed and normalized in ]0,1[
       ●       depth (ancestors)
       ●       breadth (direct neighbors)
       ●       breadth (inverse neighbors)
   ●       Document Partitioning
       ●       For each document
           ●    Each synset votes for a cluster
   ●       User Query Processing
       ●       Brute force WSD to find the best synsets combination
       ●       Best combination used to return a ranked list of clusters
       ●       Each cluster has a list of related documents obtained by the Document
               Partitioning phase


ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                      10
Reasoning ‘by association’
                     Breadth-First Search
         Given two nodes (concepts), a Breadth-First Search starts from
         both nodes, the former searches the latter's frontier and vice
         versa, until the two frontiers meet. Then the path is restored
         going backward to the roots in both directions.




         We also provide the number of positive/negative instances, and
         the corresponding ratios over the total to help understanding
         different gradations (permitted, prohibited, typical, rare, etc.) of
         actions between two objects.

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     11
Reasoning ‘by association’
                     Breadth-First Search
   The table below shows a sample of possible outcomes.
   E.g., an interpretation of case 1 can be:

   “the young looks television that talks about (and criticizes) facebook,
   because it typically does not help (rather distracts) schoolwork”.




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     12
Reasoning ‘by association’
                   Probabilistic approach
  Real world data are typically noisy
  and uncertain → need for strategies
  that soften the classical rigid logical
  reasoning



  Defined a formalism based on ProbLog language: pi :: fi
    ●   fi : ground literal of the form link (subject, verb, complement)
    ●   pi : ratio between the
        sum of all examples for
        which fi holds and the sum
        of all possible links between
        subject and complement

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     13
ConNeKTion




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     14
ConNeKTion




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     15
ConNeKTion




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     16
ConNeKTion




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     17
ConNeKTion




ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     18
Conclusions

  ConNeKTion allows to learn conceptual graphs from plain text and to
  enrich them by finding concept generalizations.


  The resulting graph can be used for several purposes:
  ●   finding relationships between concepts (if any)
  ●   filtering the concepts from a particular perspective
  ●   relevant concepts recognition and information retrieval


  A suitable control panel is provided for the user to comfortably carry out
  these activities.



ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                     19
Future Works
  We plan to improve the natural language text pre-processing using anaphora
  resolution in order to replace, where possible, pronouns with the explicit concept
  they express.


  All functionalities have parameters set empirically. A criteria for automatical
  setting of suitable parameters is needed.


  The preseted functionalities are based on the exploitation of WordNet. A strategy
  to make the operators WordNet free can be desirable.


  We also wish to extend the reasoning operators by adding an argumentation
  operator, that could exploit probabilistic weights, intended as a rate of reliability,
  to provide support or attack to a given statement.


ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text
F. Leuzzi, S. Ferilli, F. Rotella                                                      20

More Related Content

PDF
Using ontology for natural language processing
PPTX
Use of ontologies in natural language processing
PDF
Statistics-based Approaches to Lexical Semantics
PDF
Ph.D. Defense Presentation of "Open-Domain Word-Level Interpretation of Norwe...
PDF
10.1.1.35.8376
PDF
Common Logic: An Evolutionary Tale
PDF
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
PDF
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text
Using ontology for natural language processing
Use of ontologies in natural language processing
Statistics-based Approaches to Lexical Semantics
Ph.D. Defense Presentation of "Open-Domain Word-Level Interpretation of Norwe...
10.1.1.35.8376
Common Logic: An Evolutionary Tale
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
Improving Robustness and Flexibility of Concept Taxonomy Learning from Text

What's hot (11)

PDF
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
PDF
textTOvec: Deep Contextualized Neural Autoregressive Topic Models of Language...
PDF
Pankaj Gupta CV / Resume
PDF
Rethinking Critical Editions of Fragments by Ontologies
PDF
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
PDF
An Approach to Automated Learning of Conceptual Graphs from Text
PPT
The role of linguistic information for shallow language processing
PDF
The Semantic Web #8 - Ontology
PDF
Centroid-based Text Summarization through Compositionality of Word Embeddings
PPTX
The Semantic Web #7 - RDF Semantics
ODP
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
textTOvec: Deep Contextualized Neural Autoregressive Topic Models of Language...
Pankaj Gupta CV / Resume
Rethinking Critical Editions of Fragments by Ontologies
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
An Approach to Automated Learning of Conceptual Graphs from Text
The role of linguistic information for shallow language processing
The Semantic Web #8 - Ontology
Centroid-based Text Summarization through Compositionality of Word Embeddings
The Semantic Web #7 - RDF Semantics
How to Ground A Language for Legal Discourse In a Prototypical Perceptual Sem...
Ad

Viewers also liked (8)

PDF
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
PDF
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
PDF
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
ODP
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
PPTX
Take your sbdc online
PDF
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
PDF
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
PDF
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
Recognising the Social Attitude in Natural Interaction with Pedagogical Agents
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
Take your sbdc online
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
A Run Length Smoothing-Based Algorithm for Non-Manhattan Document Segmentation
A Domain Based Approach to Information Retrieval in Digital Libraries - Rotel...
Ad

Similar to ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text (20)

PDF
Timo Honkela: From Patterns of Movement to Subjectivity of Understanding
PDF
Disentangled Representation Learning in Speech and Vocalization
PDF
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
PDF
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
PDF
Timo Honkela: Multimodally Grounded Translation by Humans and Machines
PDF
Using the Framework of Networks to Enhance Learning and Social Interactions
PDF
Semiotics and conceptual modeling gv 2015
PDF
Timo Honkela: Subjects on objects in contexts: Using GICA method to quantify ...
PDF
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
PDF
CAL2011 Discourse-Centric Learning Analytics Briefing
PDF
Ai notes
PPTX
Natural lanaguage processing
PDF
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
PDF
Educational Futures Evidence Hub
PDF
Introduction to Natural language Processing
PDF
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
PPT
Dimensions of Media Object Comprehensibility
PDF
Listening comprehension in efl teaching
PDF
Listening Comprehension in EFL Teaching
PDF
Machine learning-and-data-mining-19-mining-text-and-web-data
Timo Honkela: From Patterns of Movement to Subjectivity of Understanding
Disentangled Representation Learning in Speech and Vocalization
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Cooperating Techniques for Extracting Conceptual Taxonomies from Text
Timo Honkela: Multimodally Grounded Translation by Humans and Machines
Using the Framework of Networks to Enhance Learning and Social Interactions
Semiotics and conceptual modeling gv 2015
Timo Honkela: Subjects on objects in contexts: Using GICA method to quantify ...
ONTOLOGICAL MODEL FOR CHARACTER RECOGNITION BASED ON SPATIAL RELATIONS
CAL2011 Discourse-Centric Learning Analytics Briefing
Ai notes
Natural lanaguage processing
XAI LANGUAGE TUTOR - A XAI-BASED LANGUAGE LEARNING CHATBOT USING ONTOLOGY AND...
Educational Futures Evidence Hub
Introduction to Natural language Processing
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
Dimensions of Media Object Comprehensibility
Listening comprehension in efl teaching
Listening Comprehension in EFL Teaching
Machine learning-and-data-mining-19-mining-text-and-web-data

ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text

  • 1. Università degli studi di Bari “Aldo Moro” Dipartimento di Informatica ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella L.A.C.A.M. {fabio.leuzzi, stefano.ferilli, fulvio.rotella}@uniba.it http://lacam.di.uniba.it 9th Italian Research Conference on Digital Libraries Università la Sapienza - Rome, Italy January 31 - February 1, 2013
  • 2. Overview ● Introduction & Objectives ● Tool overview ● Knowledge Representation Formalism ● Relevant concepts ● Information Retrieval ● Reasoning by Association ● Exploiting Tool ● Conclusions & Future Works ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 2
  • 3. Introduction Some repositories leave the responsibility of quality to the authors. + Anybody can produce and distribute documents. = Possible low average quality of the repository contents. The study, understanding and exploitation of the content of a digital library, with the aim to easily explore the semantic content of huge amounts of text. ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 3
  • 4. Introduction Possible solution: ● Natural Language Processing systems ● Provide the grammatical structures contained in text ● Knowledge Representation formalisms ● Semantic networks ● Graph learning techniques ● To obtain a semantic network starting from the text ● In order to satisfy the information needs, the knowledge base can be exploited: ● To make summarizations ● To reason with it ● ... ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 4
  • 5. Objectives Improving fruition of a DL ● Use of a tool providing advanced functionalities ● Mixed strategy for relevant concept recognition ● Semantic approach to information retrieval ● Automatic inference over the acquired knowledge ● Reasoning by association ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 5
  • 6. Tool overview ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 6
  • 7. Knowledge representation formalism Only subject, verb and complement have been considered. ● Subjects and complements → concepts ● Verbs → relations between them subject, subject, verb, complement complement The frequency of arcs between the concepts in positive and negative sentences has been taken into account. ● Enrich the representation formalism ● Give robustness to our solution through a statistical approach ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 7
  • 8. Relevant Concepts ● Relevant nodes are sought in the graph ● Mixed strategy ● Semantic network structure ● EM clustering provided by Weka ● Keyword Extraction ● Quantitative approach based on co-occurrences ● Qualitative approach exploiting WordNet ● Psychological approach based on principles of an effective presentation ● Components empirically weighted ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 8
  • 9. Information Retrieval ● Word Sense Disambiguation ● One Domain per Discourse assumption: many uses of a word in a coherent portion of text tend to share the same domain ● Prevalent domain individuation ● Extraction of all synsets for each term ● Extraction of all domains for each synset ● Choice of prevalent domain synset ● Pairwise Complete Link Agglomerative Clustering ● Each synset generates a singleton cluster ● For each pair of clusters ● If the complete link property holds ● Merge the involved clusters ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 9
  • 10. Information Retrieval ● Multi-strategy Similarity Measure on WordNet ● 3 components summed and normalized in ]0,1[ ● depth (ancestors) ● breadth (direct neighbors) ● breadth (inverse neighbors) ● Document Partitioning ● For each document ● Each synset votes for a cluster ● User Query Processing ● Brute force WSD to find the best synsets combination ● Best combination used to return a ranked list of clusters ● Each cluster has a list of related documents obtained by the Document Partitioning phase ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 10
  • 11. Reasoning ‘by association’ Breadth-First Search Given two nodes (concepts), a Breadth-First Search starts from both nodes, the former searches the latter's frontier and vice versa, until the two frontiers meet. Then the path is restored going backward to the roots in both directions. We also provide the number of positive/negative instances, and the corresponding ratios over the total to help understanding different gradations (permitted, prohibited, typical, rare, etc.) of actions between two objects. ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 11
  • 12. Reasoning ‘by association’ Breadth-First Search The table below shows a sample of possible outcomes. E.g., an interpretation of case 1 can be: “the young looks television that talks about (and criticizes) facebook, because it typically does not help (rather distracts) schoolwork”. ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 12
  • 13. Reasoning ‘by association’ Probabilistic approach Real world data are typically noisy and uncertain → need for strategies that soften the classical rigid logical reasoning Defined a formalism based on ProbLog language: pi :: fi ● fi : ground literal of the form link (subject, verb, complement) ● pi : ratio between the sum of all examples for which fi holds and the sum of all possible links between subject and complement ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 13
  • 14. ConNeKTion ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 14
  • 15. ConNeKTion ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 15
  • 16. ConNeKTion ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 16
  • 17. ConNeKTion ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 17
  • 18. ConNeKTion ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 18
  • 19. Conclusions ConNeKTion allows to learn conceptual graphs from plain text and to enrich them by finding concept generalizations. The resulting graph can be used for several purposes: ● finding relationships between concepts (if any) ● filtering the concepts from a particular perspective ● relevant concepts recognition and information retrieval A suitable control panel is provided for the user to comfortably carry out these activities. ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 19
  • 20. Future Works We plan to improve the natural language text pre-processing using anaphora resolution in order to replace, where possible, pronouns with the explicit concept they express. All functionalities have parameters set empirically. A criteria for automatical setting of suitable parameters is needed. The preseted functionalities are based on the exploitation of WordNet. A strategy to make the operators WordNet free can be desirable. We also wish to extend the reasoning operators by adding an argumentation operator, that could exploit probabilistic weights, intended as a rate of reliability, to provide support or attack to a given statement. ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned from Text F. Leuzzi, S. Ferilli, F. Rotella 20