RCAV Model:

Advanced Management of

Unstructured Content

  • Issue: Inventory Management
    Problem: Without a comprehensive inventory, enterprises struggle to understand the scope and location of their data.
    Solution: The Rationalize step addresses this by inventorying all data, categorizing it according to the WHERE (location) dimension—Country, State/Province/District, and ZIP. This first step lays the groundwork for more detailed analysis in subsequent steps.

  • Issue: Data Utilization
    Problem: Unstructured content remains underutilized, obscuring potential insights and value.
    Solution: Classification organizes content into the WHAT dimension—Genre, Type, and Format—transforming raw data into structured, manageable categories. This facilitates efficient data retrieval and governance across all business units.

  • Issue: Data Precision
    Problem: Crucial information within documents often remains hidden because it is not systematically identified and extracted.
    Solution: Attribute Extraction delves deeper into the WHO (People and Entities involved) and WHEN (Time-related data) dimensions, pulling out critical data elements like names, dates, times, and roles. This step ensures that each piece of data is accurately attributed for use in compliance, reporting, and decision-making.

  • Issue: Data Integrity
    Problem: There is a risk of basing decisions on incomplete or inaccurate data without proper validation.
    Solution: Validation ensures the integrity of data across all dimensions—WHAT, WHERE, WHEN, WHO—confirming that classifications and attributions are correct and complete, thus ensuring reliable data for business operations.

3DI Framework

The 3DI framework is based on a structured approach to data classification and analysis using four key dimensions:  WHATWHEREWHEN, and WHO. Each dimension is broken down into a 3-tier model to provide detailed and actionable metadata for documents or records. Here's an overview of the 3-tier framework for each dimension:

    • Tier 1: Genre – Defines the broad category of the document, such as "Agreements."

    • Tier 2: Type – Specifies the exact type of document within that genre, like "Agreement-NDA."

    • Tier 3: Format – Identifies the file format, such as "MSWord," "PDF," or "TIFF."

    • Tier 1: Country – Detects the country of origin, like "US" or "GB."

    • Tier 2: State/Province/District – Pinpoints the specific state or province, e.g., "California" or "TX."

    • Tier 3: ZIP – Narrows it down to ZIP code level, for example, "90210" or the local equivalent.

    • Tier 1: Year – Identifies the specific year, e.g., "2024."

    • Tier 2: Month/Day – Breaks down the date further, such as "10-16."

    • Tier 3: Time – Provides the exact time in "HH:MM

      " format, e.g., "14:45:30 UTC."

    • Tier 1: Entity – Refers to the organization or corporation, such as "FedEx" or "US Army."

    • Tier 2: Person – Specifies the individual or custodian responsible, e.g., "John Martin."

    • Tier 3: Referred – Lists other people mentioned or involved in the document, e.g., in an email's "TO," "CC," or "Author" fields.

Get Started