1887

Abstract

Abstract

In order to improve the browsing activity in a documentary database, we propose a conceptual approach for multi-level restructuring of categorized documents in a corpus. Starting from a manual and static organized corpus, based on the domain ontology, we derive new dynamically generated structures embedded in the static one. We use a conceptual recursive indexing method based on the selection of the minimal number of concepts covering either a document or a subset of documents corresponding to a sub-corpus. Hence, our system provides an additional browsing feature to the user, by dynamically providing the system with a conceptual structure of clusters of documents. For illustration, you may find in the figure an application to Arabic financial news for a particular ontology.

Therefore, one finds sub-category under the category . Also, under the category, etc. In parallel with the classical browser system, indexing words, provided for each level, give the user more details about the file's content, as well as the category content, before further exploration. Our approach improves human-computer interaction by decreasing the browsing time. Assessment of the proposed method proves that combining manual documents categorizations, with the automatic feature generations, gives a flexible and effective structured browsing interface to the users. Finally, low-level features help for incrementally placing new documents in the right category, by using suitable supervised classification methods.

Loading

Article metrics loading...

/content/papers/10.5339/qfarf.2010.CSP2
2010-12-13
2024-03-28
Loading full text...

Full text loading...

References

  1. S. Elloumi, A. Al Jaoua, F. Ferjani, M.J. Jaam, F. Laban, H. Hammami, N. Semar, Conceptual approach for multi-level restructuring of categorized documents in a corpus, QFARF Proceedings, 2010, CSP2.
    [Google Scholar]
http://instance.metastore.ingenta.com/content/papers/10.5339/qfarf.2010.CSP2
Loading
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error