The document presents an automatic annotation method for data units in search results returned by web databases, ensuring that these units are semantically labeled for machine processing. The approach involves aligning data units into groups with the same semantic meaning, annotating them via multiple annotators, and creating a reusable annotation wrapper for efficient future use. Experiments indicate the proposed technique is effective and enhances the scalability of extracting meaningful information from various data sources.
Related topics: