Texts/Textes in areas (2024-03-15)
Jérôme Euzenat, L'annotation formelle de documents en (8) questions, in: Régine Teulier, Jean Charlet, Pierre Tchounikine (éds), Ingénierie des connaissances, L'Harmattan, Paris (FR), 2005, pp251-271
Annoter un ensemble de documents informels à l'aide de représentations formelles appelle plusieurs questions qui doivent trouver une réponse si l'on veut développer un système cohérent. Ces questions sont liées à la forme et à l'objet des représentations retenues, à la nécessité d'utiliser de la connaissance indépendante du contenu des documents (ontologies, connaissance de contexte) et au statut du système résultant (grande base de connaissance ou éléments de connaissance distribués). Ces questions sont décrites et illustrées par l'annotation de résumés d'articles en génétique moléculaire.
Web sémantique, recherche de documents par le contenu, annotation formelle, représentation du contenu, ontologie, connaissance de contexte
Jérôme Euzenat, Eight questions about semantic web annotations, IEEE Intelligent systems 17(2):55-62, 2002
Improving information retrieval is annotation¹s central goal. However, without sufficient planning, annotation - especially when running a robot and attaching automatically extracted content - risks producing incoherent information. The author recommends answering eight questions before you annotate. He provides a practical application of this approach, and discusses applying the questions to other systems.
Semantic web, Search by content, Content representation, Ontology, Background knowledge
Farid Cerbah, Jérôme Euzenat, Traceability between models and texts through terminology, Data and knowledge engineering 38(1):31-43, 2001
Modeling often concerns the translation of informal texts into representations. This translation process requires support for itself and for its traceability. We pretend that inserting a terminology between informal textual documents and their formalization can help to serve both of these goals. Modern terminology extraction tools support the formalization process by using terms as a first sketch of formalized concepts. Moreover, the terms can be employed for linking the concepts and the textual sources. They act as a powerful navigation structure. This is exemplified through the presentation of a fully implemented system.
Terminology extraction, Traceability, Model generation, Hypertext, Object-oriented modeling, Natural language
Jérôme Euzenat, L'annotation formelle de documents en huit (8) questions, in: Jean Charlet (éd), Actes 6e journées sur ingénierie des connaissances (IC), Grenoble (FR), pp95-110, 2001
Annoter un ensemble de documents informels à l'aide de représentations formelles appelle plusieurs questions qui doivent trouver une réponse si l'on veut développer un système cohérent. Ces questions sont liées à la forme et à l'objet des représentations retenues, à la nécessité d'utiliser de la connaissance indépendante du contenu des documents (ontologies, connaissance de contexte) et au statut du système résultant (grande base de connaissance ou éléments de connaissance distribués). Ces questions sont décrites et illustrées par la tentative d'annotation de résumés d'articles en génétique moléculaire.
Web sémantique, recherche par le contenu, annotation formelle, représentation du contenu, ontologie, connaissance de contexte
Farid Cerbah, Jérôme Euzenat, Using terminology extraction techniques for improving traceability from formal models to textual requirements, in: Proc. 5th international conference on applications of natural language to information systems (NLDB), Versailles (FR), (Mokrane Bouzeghoub, Zoubida Kedad, Élisabeth Métais (eds), Natural Language Processing and Information Systems, Lecture notes in computer science 1959, 2001), pp115-126, 2000
This article deals with traceability in sotfware engineering. More precisely, we concentrate on the role of terminological knowledge the mapping between (informal) textual requirements and (formal) object models. We show that terminological knowledge facilitates production of traceability links, provided that language processing technologies allow to elaborate semi-automatically the required terminological resources. The presented system is one step towards incremental formalization from textual knowledge.
XML, terminology, knowledge extraction
Farid Cerbah, Jérôme Euzenat, Integrating textual knowledge and formal knowledge for improving traceability, in: Proc. ECAI workshop on Knowledge Management and Organizational Memory, Berlin (DE), pp10-16, 2000
This article deals with traceability in knowledge repositories. More precisely, we concentrate on the role of terminological knowledge in the mapping between (informal) textual requirements and (formal) object models. We show that terminological knowledge facilitates the production of traceability links, provided that language processing technologies allow to elaborate semi-automatically the required terminological resources. The presented system is one step towards incremental formalization from textual knowledge. As such, it is a valuable tool for building knowledge repositories.
XML, terminology, knowledge extraction
Farid Cerbah, Jérôme Euzenat, Integrating textual knowledge and formal knowledge for improving traceability, in: Proc. 12th international conference on knowledge engineering and knowledge management (EKAW), Juan-les-Pins (FR), (Rose Dieng, Olivier Corby (eds), Knowledge engineering and knowledge management: methods, models and tools, Lecture notes in computer science 1937, 2000), pp296-303, 2000
Knowledge engineering often concerns the translation of informal knowledge into a formal representation. This translation process requires support for itself and for its We pretend that inserting a terminological structure between informal textual documents and their formalization serves both of these goals. Modern terminology extraction tools support the process where the terms are a first sketch of formalized concepts. Moreover, the terms can be used for linking the concepts and the pieces of texts. This is exemplified through the presentation of an implemented system.
XML, terminology, knowledge extraction
Jérôme Euzenat, Vers une plate-forme de diffusion de textes sur internet : étude préliminaire, Rapport de conseil, 63p., juin 2000
Farid Cerbah, Jérôme Euzenat, Intégration de connaissances modélisées et de connaissances textuelles: spécification d'un système d'aide à la pose de liens de traçabilité, Deliverable DGT 7672, Dassault aviation, Saint-Cloud (FR), 15p., avril 1999
Jérôme Euzenat, Intégration de connaissances modélisées et de connaissances textuelles : intégration objets-termes-textes via XML, Deliverable, Dassault aviation, Saint-Cloud (FR), 16p., septembre 1999
Jérôme Euzenat, Corporate memory through cooperative creation of knowledge bases and hyper-documents, in: Proc. 10th workshop on knowledge acquisition (KAW), Banff (CA), pp(36)1-18, 1996
Best paper of the corporate memory and enterprise modelling track
The Co4 system is dedicated to the representation of formal knowledge in an object and task based manner. It is fully interleaved with hyper-documents and thus provides integration of formal and informal knowledge. Moreover, consensus about the content of the knowledge bases is enforced with the help of a protocol for integrating knowledge through several levels of consensual knowledge bases. Co4 is presented here as addressing three claims about corporate memory: (1) it must be formalised to the greatest possible extent so that its semantics is clear and its manipulation can be automated; (2) it cannot be totally formalised and thus formal and informal knowledge must be organised such that they refer to each other; (3) in order to be useful, it must be accepted by the people involved (providers and users) and thus must be non contradictory and consensual.
Jérôme Euzenat, Building consensual knowledge bases: context and architecture, in: Nicolaas Mars (ed), Towards very large knowledge bases, IOS press, Amsterdam (NL), 1995, pp143-155
A protocol and architecture are presented in order to achieve consensual knowledge bases (i.e. bases in which knowledge is expressed in a formal language and which are considered as containing the state of the art in some research area). It assumes that the construction of the base must and can be achieved collectively. The architecture is based on individual workstations which provide support for developing a knowledge base: formal expression of knowledge through objects, tasks and qualitative equations annotated with hypertext nodes and links. It also provides tools for detecting similarities and inconsistencies between pieces of knowledge. These bases can be grouped together in order to constitute a new reference knowledge base. The process for constructing this last base mimics the submission of articles to peer-reviewed journals. This is achieved through a protocol for submitting knowledge to the group base, confronting it with the content of that base, amending it accordingly, reviewing it by the other knowledge bases and finally incorporating it. The system is to be used by researchers in the field of genome sequencing.
CSCW, knowledge sharing, knowledge revision, negotiation, protocol, knowledge communication