Skip to content

Glossary

​Asynchronous Task-Based Architecture

An architectural approach where tasks are processed asynchronously, using task queues and workers, to improve efficiency and scalability.

​Audit Logging

The process of recording all system activities and user actions to ensure traceability and accountability.

​Azure Active Directory (Azure AD)

A cloud-based identity and access management service that provides secure authentication and authorization.

​Azure AI Services

A suite of tools provided by Azure for building, deploying, and managing machine learning models and AI applications.

​Azure Blob Storage

A scalable and secure cloud storage service provided by Azure, used for storing large amounts of unstructured data.

​Azure Functions

A serverless compute service by Azure that allows code to run in response to events without managing the underlying infrastructure.

​Azure Key Vault

A service that securely stores and manages sensitive information such as API keys, passwords, and certificates.

​Azure Text Analytics for PII

A service within Azure that uses natural language processing to identify and redact PII from text documents.

​Backup and Recovery

Procedures to ensure data integrity and availability, including regular data backups and disaster recovery plans.

​Celery

An asynchronous task queue/job queue system used to manage and execute tasks in the background.

​Celery Worker

A background process that picks up and executes tasks from a queue managed by Celery.

​Customer Satisfaction

Measuring user satisfaction through surveys and feedback to gauge the system's performance and ease of use.

​End-to-End (E2E) Testing

A testing methodology that verifies the functionality of the entire application from start to finish, ensuring all components work together as expected.

​FastAPI

A high-performance framework for building APIs with Python, known for its speed and efficiency in handling requests.

​Fault Tolerance

The system's ability to continue operating properly in the event of the failure of some of its components.

​Feature Completeness

Implementing all necessary features as outlined in the functional requirements, including secure file transfers, automated metadata generation, and robust logging mechanisms.

​General Data Protection Regulation (GDPR)

A regulation in EU law on data protection and privacy, ensuring the protection of personal data for individuals within the European Union.

​Human-in-the-Loop (HITL)

An approach where human input is used to improve and validate machine learning models, ensuring higher accuracy and reliability.

​HTTPS Protocol

Hypertext Transfer Protocol Secure, an extension of HTTP that uses encryption (TLS/SSL) to secure data transmitted over the internet.

​Integration

Successful integration with external systems like Relativity One, including retrieving file content, updating metadata tags, and listing documents and workspaces.

​Maintainability

Ensuring the system is designed with modularity and clear separation of concerns to facilitate easy maintenance and future updates.

​Metadata Accuracy

Ensuring that metadata tags are accurately generated and reflect the correct PII status of each document.

​Metadata Tags

Information that describes other data, in this context, tags added to documents to indicate the presence of PII.

​Minimal Viable Integration

Implementing the minimal necessary features to prove the concept, including retrieving file content from Relativity One, processing files for PII, and updating metadata tags on the documents.

​Modular Design

An architectural design principle where the system is divided into smaller, self-contained modules, each responsible for a specific functionality, promoting maintainability and scalability.

​Nuxt.js

A framework for building server-side rendered applications with Vue.js, enhancing the development experience with features like routing and automatic code-splitting.

​Performance

Ensuring the system handles data transfers and processing efficiently, with minimal latency and high throughput.

​PII (Personally Identifiable Information)

Information that can be used to identify an individual, such as names, addresses, social security numbers, etc.

​PII Detection

The process of identifying PII within documents, aiming for high accuracy and minimal false positives/negatives.

​Plain Text Assumption

The system assumes that the file content retrieved via the REST API will be in plain text, which may limit the types of documents that can be processed.

​PostgreSQL

A powerful, open-source object-relational database system that provides robust data storage and retrieval capabilities.

​Precision

A measure of accuracy, indicating the percentage of correctly identified instances out of all identified instances.

​Presentation Layer

The layer that handles user interaction and presentation logic, typically involving the user interface.

​Proof of Concept (POC)

An early demonstration to validate the feasibility of a concept or idea in a real-world scenario. It is used to prove that the system can meet its intended goals.

​Redis

An in-memory data structure store used as a database, cache, and message broker to improve performance by storing frequently used data temporarily.

​Relativity One

A well-known document platform used for legal and compliance purposes, which provides tools for managing and analyzing large volumes of data.

​Reliability

Ensuring the system demonstrates stable performance with minimal downtime and includes robust error handling and fault tolerance mechanisms.

​REST API

Representational State Transfer Application Programming Interface, a set of rules and conventions for building and interacting with web services, allowing different systems to communicate over HTTP.

​Response Times

The time it takes for the system to respond to user interactions or complete processing tasks, aiming for minimal latency.

​Role-Based Access Control (RBAC)

A method of regulating access to resources based on the roles of individual users within an organisation.

​Scalability

The ability of the system to handle increasing data volumes and dynamic scaling of resources.

​Security

Implementing robust measures to protect data and ensure that only authorized users can access and manage the system.

​Single Sign-On (SSO)

An authentication process that allows a user to access multiple applications with one set of login credentials.

​Task Queuing by Using RabbitMQ

The process of placing tasks in a queue for asynchronous execution by Celery workers.

​Tailwind CSS

A utility-first CSS framework that provides pre-built classes for rapid development of responsive user interfaces.

​Usability

Ensuring the system has an intuitive and user-friendly interface, providing clear instructions and help documentation for users.

​User Acceptance Testing (UAT)

A phase of software testing where the end users test the system to ensure it meets their needs and requirements.

​User Interface (UI)

The part of the application that users interact with, designed to be user-friendly and intuitive.

​User Technical Proficiency

The assumption that users have the necessary skills and knowledge to use the system effectively.

​User-Friendly Interface

A user interface designed to be easy to use, intuitive, and accessible, ensuring a positive user experience.