Document representation
By a document representation is usually meant a bibliographic record, which represents a document.
Sometimes this term is used in a narrower meaning about subject representation (as opposed to document description).
In a full text environment (including full-text Internet documents) a representation of the document may be identical with the document which is being represented (or it may include the document by adding metadata). It is an important theoretical question in information science whether or not a document is optimal representation of itself or whether indexing or other kinds of metadata provided by information professionals can improve the findability of documents.
Literature:
Barry, C. L. (1998). Document representations and clues to document
relevance. Journal of the American Society for Information Science,
49(14), 1293-1303.
Croft, W. B. (1981). Document representation in probabilistic models of
information-retrieval. Journal of the American Society for Information
Science, 32(6), 451-457.
Dolin, R. H.; Alschuler, L.; Boyer, S. & Beebe, C. (2000). An update on HL7's XML-based document representation standards. Journal of the American Medical Informatics Association, S, 190-194. Available: http://www.amia.org/pubs/symposia/D200113.PDF
Janes, J. W. (1991). Relevance judgments and the incremental presentation of
document representations. Information Processing & Management, 27(6),
629-646.
Kwok, K. L. (1975). Use of title and cited titles as document representation for automatic classification. Information Processing & Management, 11(8-12), 201-206.
Kwok, K. L. (1988). On the use of bibliographically related titles for the enhancement of document representations. Information Processing & Management, 24(2), 123-131.
Lalmas, M. (2000). Combining document representations.
International Journal of Cooperative Information Systems, 9(4), 427-447.
Marega, R. & Pazienza, M .T. (1994). CODHIR - An information-retrieval system based on semantic document representation. Journal of Information Science, 20(6), 399-412.
Oh, S. G. (1998). Document representation and retrieval using empirical facts: Evaluation of a pilot system Journal of the American Society for Information Science, 49(10), 920-931.
Paijmans, H. (1993). Comparing the document representations of 2 IR-systems -
CLARIT and TOPIC. Journal of the American Society for Information Science,
44(7), 383-392.
Srinivasan, P. (1990). A comparison of 2-poisson, inverse document frequency and
discrimination value models of document representation. Information
Processing & Management, 26(2), 269-278.
See also: Partial text representation
Birger Hjørland
Last edited: 30-04-2006