Skip to content

File

  • Document is a generated schema and shouldn’t be created directly by users.

Relations to other schemata

Document has the following relations to other schemata in the model:

Relation Schemata
Properties inherited from: Analyzable
Thing
More precise schemata: HyperText
Email
Pages
Package
Message
PlainText
Article
Image
Audio
Video
Table
Folder
Workbook

File has multiple properties to describe its data. Some properties are inherited from its parent schemata.

Icon legend

This property is used to compute a caption for the entity.

This property is a featured property, which means in UI applications it should be shown with higher information hierarchy.

This property is a matchable property and can be used in matching systems to find similar entities.

This property is a hidden property and should not be rendered in UI applications.

Name Label Type
Document:fileName File name string
Document:title Title string
Document:mimeType MIME type mimetype
Document:parent Folder entity
Document:contentHash Checksum
SHA1 hash of the data
checksum
Document:author Author
The original author, not the uploader
string
Document:generator Generator
The program used to generate this file
string
Document:crawler Crawler
The crawler used to acquire this file
string
Document:fileSize File size number
Document:extension File extension string
Document:encoding File encoding string
Document:bodyText Text text
Document:messageId Message ID
Message ID of a document; unique in most cases
string
Document:language Language language
Document:translatedLanguage The language of the translated text language
Document:translatedText Translated version of the body text text
Document:date Date
If not otherwise specified
date
Document:authoredAt Authored on date
Document:publishedAt Published on date
Document:ancestors Ancestors entity
Document:processingStatus Processing status string
Document:processingError Processing error string
Document:processingAgent Processing agent
Name and version of the processing agent used to process the Document
string
Document:processedAt Processed at
Date and time of the most recent ingestion of the Document
date
Document:proven Derived entities entity
Document:provenIntervals Derived relationships entity
Thing:name Name name
Thing:summary Summary text
Thing:description Description text
Thing:country Country country
Thing:alias Other name name
Thing:previousName Previous name name
Thing:weakAlias Weak alias name
Thing:sourceUrl Source link url
Thing:publisher Publishing source string
Thing:publisherUrl Publishing source URL url
Thing:alephUrl Aleph URL url
Thing:wikipediaUrl Wikipedia Article url
Thing:wikidataId Wikidata ID identifier
Thing:keywords Keywords string
Thing:topics Topics topic
Thing:address Address address
Thing:addressEntity Address entity
Thing:program Program
Regulatory program or sanctions list on which an entity is listed.
string
Thing:programId Program ID identifier
Thing:notes Notes text
Thing:proof Source document entity
Thing:indexText Index text text
Thing:createdAt Created at date
Thing:modifiedAt Modified on date
Thing:retrievedAt Retrieved on date
Thing:courtCase Court cases entity
Analyzable:detectedLanguage Detected language language
Analyzable:detectedCountry Detected country country
Analyzable:namesMentioned Detected names name
Analyzable:peopleMentioned Detected people name
Analyzable:companiesMentioned Detected companies name
Analyzable:ibanMentioned Detected IBANs identifier
Analyzable:ipMentioned Detected IP addresses ip
Analyzable:locationMentioned Detected locations address
Analyzable:phoneMentioned Detected phones phone
Analyzable:emailMentioned Detected e-mail addresses email
Thing:unknownLinkTo Linked to entity
Thing:unknownLinkFrom Linked from entity
Document:mentionedEntities Extracted names entity
Thing:candidateSimilars Similar to this entity
Thing:matchSimilars Similar as this entity
Thing:noteEntities Notes entity
Thing:sanctions Sanctions entity
Document:relatedEntities Related entities entity
Thing:documentedBy Documents entity

How to use it in a graph or timeline

FtM has well-defined semantics for different representations of entities, for example in a network graph or in a timeline.

When using this schema in a graph representation, it should be represented as a node.

Semantic Property
Temporal start Document:date
Document:authoredAt
Document:publishedAt
Temporal end -