Skip to main content

Brainspace

Field Definitions

Below you will find the definitions for the various fields used within Brainspace; this includes both internal fields and Entity Extraction fields.

Internal Fields

brs_domain - The domain portion of email addresses detected brs_dup_type - The duplicate type this document was classified as -- when the cluster is Normal     all documents -> Unique -- when the cluster is ExactDup      pivot document -> ExactOrig      other documents -> ExactDup -- when the cluster is NearDup      pivot document -> NearOrig      other documents -> NearDup -- when the cluster is ExactDup AND (isPivotDocument = true)      pivot document -> ExactOrigNearOrig      other documents -> ExactDup -- when the cluster is ExactDup AND (isPivotDocument = false)      pivot document -> ExactOrigNearDup      other documents -> ExactDup -- when the cluster is Excluded      all documents -> Unique brs_emt_action - Type of message: Send, Reply, ReplyAll, Forward, or Draft brs_emt_duplicateSpecifies if this document is a duplicate: True/False brs_emt_is_message - Specifies if this document is an email: True/False brs_emt_has_missing_message - Email thread has been identified to have a message that i0s not present in the dataset brs_emt_unique - Message is unique, due to the message itself and/or an attachment. (brs_emt_unique_reason details why it is unique.) brs_emt_unique_message - Specifies if this email is the most inclusive of a branch of an email thread brs_emt_unique_reason - Specifies why the email was flagged as unique: Message, Attach, or Message_and_Attach brs_emt_was_unique_attach - Specifies if a document was unique in a previous build due to attachment, but after incremental build is no longer unique: True/False brs_emt_was_unique_message - Specifies if a message was unique in a previous build, but after incremental build is no longer unique: True/False brs_emt_was_updated - Specifies if this message in a thread was updated after incremental build: True/False brs_from_domain - The domain potion of the email addresses detected as having sent the email brs_language_segment - A unique primary language for an identified segment. (If the same language is detected for multiple segments it will only be listed once) brs_primary_language - The single most likely and prominent language in a document (according to the language detector) brs_recipient - Email addresses detected as having received the email brs_recipient_domain - The domain potion of email addresses detected as having received the email

Entity Extraction Fields

Entity_Credit_Card_Num - A sequence of numbers that are detected as patterns used by the major credit card providers Entity_Email - A sequence of characters that appear to be legitimate email addresses Entity_Location - A city, state, country, region, or other location that contains both a population and a government. - A geographic place such as a body of water, mountain, park, or address. - A structure such as a building or monument. Entity_Money - A sequence of symbols, numbers and/or words that are detected as referencing money Entity_Nationality - Reference to a country or region of origin, such as American or Swiss. Entity_Organization - A corporation, institution, government agency, or other group of people defined by an established organizational structure. Entity_Person - A human identified by name, nickname, or alias. Entity_Personal_ID_Num - A series of digits in a pattern detected as a national or well known personal identifier Entity_Phone_Number - A series of digits in a pattern detected as a phone number Entity_Product - Reference to a commercially available product Entity_Religion - Reference to an organized religion or theology as well as its followers. Entity_Title - Appellation associated with a person by virtue of occupation, office, birth, or as an honorific. Entity_Url - Web address