Diskover Data Management Platform

Core Features

Take Control of ALL Your Data with Diskover

Diskover’s Core Features

The Diskover Data Management Platform is like a Swiss army knife offering various integrated tools, as well as smart plugins. At the heart of it all, Diskover “connects” all your data storages and file systems giving you global visibility via your favorite web-browser.

Industry Specific Editions

While this page summarizes Diskover’s core features, additional tools are available with these industry specific editions:

➡ Media & Entertainment Edition

➡ Life Science Edition

Introduction to Diskover Data Management Platform | This video talks about the current data growth and associated risk issues and then introduces Diskover’s integrated searches, analytics, workflows, and file actions as a sustainable data management solution.

Your first step towards taking control of ALL your data.

Get hands-on experience with the Diskover platform with minimal to no deployment effort by leveraging the Diskover Community Edition within the AWS Marketplace. The Diskover Community Edition is free to use for an unlimited time, however applicable AWS EC2 instance resource charges apply.

Integrated Core Features Overview

Diskover Data Curation Software file and storage search icon

Integrated SEARCHES

  • Global view via any web browser > a single query will search all your data repositories connected to Diskover.
  • Built-in search tools can be combined with manual queries for granular searches.
  • Extra search options via business context metadata.
  • Export and share your results in one click.
Icon representing a magnifying glass with a graph line in it which is used to identify Diskover multiples analysis tools.

Integrated ANALYTICS

  • Multiple standard and customizable reports for informed data management decision-making.
  • Customizable storage cost reporting.
  • Heatmap report compares data from two different points in time, monitors growth, shrinkage, and smooth transitions/backups.
Icon representing a stick figure in a superman outfit which is used to identify Diskover powerful searches and actionable results.

Integrated ACTIONS

  • Multiple integrated core and industry specific plugins.
  • File action for one-click access to third-party platform, live view of directory, data movement, etc.
  • Customers can deploy their own plugins for custom in-house workflows.
Diskover Data Curation Software System Requirements Icon

Integrated WORKFLOWS

  • Automated data curation based on rules and rich metadata.
  • Multiple plugins allowing for data curation, movement, integrity, actions, etc.
  • Automated and manual tagging to facilitate data curation.
Graphic with a closed fist at the end of a lightning bolt used to represent that Diskover Data empowers all stakeholders and allow all line of business users to have their own relationship with data. All the file search tools and analytical reports allow for in-depth data analysis and informed decision making.

Flexible, Fast, SCALABLE

  • Can index massive amounts of data at blazing speed, no matter where the data is located.
  • Non-proprietary therefore ensures the safety of your source digital assets.
  • Extra metadata harvesting adds business context to all aspects of data management.
Icon representing an arrows in a loop which is used to identify Diskover multiples workflows integration.

Sustainable Solution BENEFITS

  • Sustainable data management solution allowing you to reuse your data storage space and curate your data.
  • Empowers all stakeholders, increases productivity, and reduces human errors.
  • Reduces data-related operating costs.

Platform OVERVIEW

Diagram giving a simple overview of Diskover data management platform and indexing architecture
Diskover Data Curation Software file and storage search icon

Integrated SEARCHES

Diskover Data Management Platform file search page screenshot.
The file search page allows you to drill down any of your file systems (left pane in green) and their directories. You can also launch a single search to find files located in any of your file systems.
Diskover search page with Export drop-down list open to show options.
Users are offered several options to export and/or share their search results with a simple click.
Diskover Data Management Platform built-in search Filters
Users can refine their results using the built-in tools like the filters shown above. Filters can also be combined with manual queries and other built-in tools like quick search for maximum efficiency.
Diskover file attributes (metadata harvested during indexing)
Diskover harvests rich metadata allowing for meticulous queries and precise results. Business context metadata can also be indexed via plugins. Note that the “Media Info” fields are only available with the AJA Diskover Media Edition. Additional business context metadata is available, please contact us for details.
Icon representing a magnifying glass with a graph line in it which is used to identify Diskover multiples analysis tools.

Integrated ANALYTICS

Reports were designed to easily help you find your top unknowns. Diskover uses your rich metadata harvested during indexing to help you understand, analyze, automate, and monitor your data, therefore, giving you complete control to sustainably manage your digital assets. 

This new analytical tool is customizable with what matters to your organization, from simple file types analysis to queries using business context. Filters can also be applied by any users for further granular investigation. This report is accessible to everyone in your organization. It is also global but can be limited to a specific path if desired.

Example Using Xytech Order Status

After creating the desired queries, select 1. the data you want to analyze, and 2. the number of top results you want in your comprehensive overview. In the example below, we are using Xytech order status metadata*. Note that any links under “by number” or “by size” can be opened to list the results by line items.

Still using the example below and following the organization’s guidelines, a decision could easily be made to free a significant amount of storage space by moving the “invoiced” data to a cost-effective asset preservation storage platform or simply deleting it after delivery to the client. Please note that this process can also be automated using Diskover’s integrated workflows.

Example of customized Reports within Diskover using order status attributes harvested from Xytech Plugin available with the AJA Diskover Media Edition, giving business context for granular results.
* The Xytech Order Status plugin is available with the AJA Diskover Media Edition.

Please note that the Reports analytic tool is included with Diskover Professional, Enterprise, Media Edition, and Life Science Edition.

By project manager using Xytech metadata

Example of customized Reports within Diskover using account manager attributes harvested from Xytech order status Plugin available with the AJA Diskover Media Edition, giving business context for granular results.

By video file types

Example of customized Reports within Diskover using video file type attributes.
Example of customized Reports within Diskover using hard links attributes.
Diskover Data Management Platform Screenshot of Dashboard report
The dashboard report offers a summary of a specific volume, including several clickable links for direct access to details.
Diskover Data Management Platform Screenshot of File Tree report
The file tree report instantly profiles your data size and aging for informed decision-making. Users can drill down directly from the chart area and open the results in the search page.
Diskover Data Management Platform Screenshot of Treemap report.
The treemap report displays hierarchical data graphically representing the size of files/directories, candidates for cleanup, aging, etc. Each color corresponds to a specific directory.

Overview | No Bytes Left on the Table

Diskover Data is leading the data management industry with its cost analysis tools and their degree of customization, therefore maximizing the ROI of your valuable assets.

Diskover offers tools to assist you determining storage cost for your clients’ data/projects, as well as detailed reporting to substantiate your invoicing.

When calculating the cost per gigabyte, companies should also consider compounding additional factors like electricity, storage providers’ cost, square footage of building space, support contracts, system administrator’s salary, Diskover’s annual subscription cost, etc.

This diagram gives an overview of the storage cost configuration process in the backend of Diskover, and it's granularity. Storage cost analysis is a key feature of the Diskover Data curation software as it assists users in charging accordingly for their storage space.
Diskover’s storage cost settings can be configured globally or per storage repository, allowing for preciseness and granularity.
Diskover Data Management Platform Screenshot of Cost Analysis report.
The cost analysis tool offers an instant snapshot, as well as links to detailed information, showing where your storage money is being allocated via customized reporting. This tool is designed to 1) monitor $ spent for storage, 2) invoice your customers accurately, and 3) incentivize data curation.
Diskover Data Management Platform Screenshot of User Analysis report.
The user analysis report gives a snapshot of data utilization per user and per group. It is designed to help with operating costs management, as well as storage consumption per user/group.
This image is a screenshot of the heatmap report from the Diskover Data user interface. The heatmap analytical report allows users to compare two indices from different points in time. The red represents data growth which is usually the result of onboarding a new project or files being moved. The heatmap report is an important monitoring feature for data management.
Red indicates data growth, ex: project onboarding.
This image is a screenshot of the heatmap report from the Diskover Data user interface. The heatmap analytical report allows users to compare two indices from different points in time. The green represents data shrinkage resulting from cleanup efforts, either from data deletion or archival. The heatmap report is an important monitoring feature for data management.
Green indicates data shrinkage, ex: data deletion.
This image is a screenshot of the indices selection page from the Diskover Data user interface. This unique feature allows users to select and compare indices from two different points in time, in turn allowing for the utilization of the heatmap report. This page is also useful to monitor when indexing and scanning took place, how long it took, and more.
The indices page gives you detailed information, searchability and ease of selection.
This image is a screenshot of the File Search page while comparing Indices from the Diskover Data user interface. When comparing two indices from different points in time, users can see growth or shrinkage results in the file search page as it displays the results in colors and percentage.
Indices comparison can also be analyzed from the file search page.

Power in History

“Having Diskover run and scan your disk is great and all, but if you wanted to merely see the size of folders you could use any disk analyzer software. Where it really starts to shine is the ability to compare disk scans against each other. Once you have two Diskover runs in Elasticsearch, Diskover will allow you to select two indices to compare. By default, each scan is timestamped to easily keep track of the history.

When comparing these, Diskover will now present you with additional features, such as heatmaps, to help you find which directories have the most fluctuation in size, either growing or shrinking.”

– linuxserver.io blog 06/28/2019
Image from a user sharing his experience with the Diskover heatmap report.

This report is accessible to everyone in your organization and is global but can be limited to a specific path if desired.

Diskover Data Management Platform Screenshot of Smart Searches Report.
Diskover Data Curation Software System Requirements Icon

Integrated WORKFLOWS

Overview

Diskover’s ultimate goal is global data curation through the various tools integrated in its data management platform. Diskover allows for processes to happen without manual intervention, therefore increasing productivity, as well as reducing human prone errors.

Diskover offers tools to sustainably solve the current data growth explosion, avoiding organizations to just buy more and more storage space, instead of organizing/cleaning their data through workflows.

Although all projects are unique, the production process remains the same. Via configurable plugins and automated scheduled tasks, Diskover makes data move through a virtual conveyor belt, allowing you to keep your data organized and reuse your existing storage space.

Far Beyond Data “Cleaning”

Diskover offers a full toolset of configurable scheduled tasks allowing for safe and controlled data curation, based on your organization’s rules, therefore allowing for storage space reutilization and assets preservation.

This image is a screenshot of the Diskover task panel allowing users to easily schedule automated tasks from indexing to data curation via file deletion, archival, and more, for thorough data management.
Diskover’s smart task panel allows system administrators to easily configure indexing schedules and other automated custom tasks. 

Increased Productivity Through Automation

Diagram representing that data is now treated like a physical good and goes through a virtual conveyor belt. Diskover's plugins and other features help the automation of data manufacturing, making it efficient while tracking storage space costs. Workflows are essential to sustainable data management.

Dataflow Example

Example of Diskover workflow for data movement

Tags | Manual and Automated

This image represents a screenshot of the tags analytical report available in the Diskover Data user interface. Tagging is a very important feature used to enhance workflows and assist with data management. This report gives a graphical and numerical summary of all tags applied within the global index.
Files and directories can easily be tagged manually or via automation.

Auto tags through scheduled tasks can be configured for automated data curation based on aging, project status, service contract agreement, etc.

Diskover allows for manual tagging, alerting for actions to be taken, project workflow stages, as well as ease of searchability.

Screenshot representing how to tag files and/or directories manually via the file search page.
Tags can easily be applied, managed, and viewed in the search page.
Icon representing a stick figure in a superman outfit which is used to identify Diskover powerful searches and actionable results, empowering all stakeholders and users to achieve sustainable data curation and management.

Integrated ACTIONS

Image of puzzle pieces icon representing plugins in the Diskover Data management software, as each plugin offers a more complete and robust platform.

Due to its open-source infrastructure, Diskover allows for limitless extensibility. Our solution offers various integrated configurable plugins with many more in development. This section offers an exclusive list of plugins, please click here to visit our SOLUTIONS page for the list of all major features.

These plugins cover a wide range of functionalities like data management, curation via workflows, data movement, data integrity, etc.

🖱️ Learn more.

The File Action plugins can be launched in one click from the user interface. They offer diverse functionalities like the live view of pre-indexed data, seamless access to third-party platforms, and data movement just to name a few.

🖱️ Learn more.

Diskover allows for users to easily develop their own plugins to automate their custom in-house workflows. These plugins can be launched in one click using Diskover’s File Action feature.

🖱️ Learn more.

Core Plugins

  • Listed alphabetically
  • CE = Community Edition, ESS = Essential, PRO = Professional, ENT = Enterprise, ME= AJA Diskover Media Edition, LSE = Life Science Edition
PLUGINSHORT DESCRIPTIONCE ESSPROENTME LSE
AutocleanFunctionality | Designed to move, copy, delete, rename, or run custom commands on files and/or directories based on a set of highly configurable criteria. Any Elasticsearch query (tags, age, size, path, filename, etc.) can be used for the criteria providing very granular actions. Learn more.
Auto TagFunctionality | Designed to auto tag an existing completed index. Auto-tagging can also be done during crawl time by adding tag rules in the config file. Learn more.
ChecksumsFunctionality | Adds checksums (xxhash, md5, sha1, sha256) to files in Elasticsearch indices.
Checksums Hash DiffFile Action | Designed for precise data movement monitoring, it checksums xxhash, md5, sha1, and sha256 hash values between the original file and the resulting file once it reaches its transfer destination, alerting on areas where checksums don’t match. Learn more.
Checksums S3Functionality | Adds checksums (md5, sha1) using AWS Lambda/Fixity to files in Elasticsearch indices built using S3 alt scanner.
Duplicates (Dupes) FinderFunctionality | Checks for duplicate files across a single or all indices using xxhash, md5, sha1, and sha256 checksums. Learn more.
Elasticsearch Field CopierFunctionality | Migrates Elasticsearch field data from one index to another.
Elasticsearch Query ReportFunctionality | Sends email reports based on Elasticsearch search queries. Learn more.
Illegal File NameFunctionality | Analyzes the index of all directories and file names for illegal characters, and long filenames or file paths to proactively find potential files with names that can break applications. Offending filenames are tagged with the corresponding non-conformance and the list of illegal filenames can then be sent via email reports. The plug-in can be configured to remediate these issues with automatic renaming or character replacement. Learn more.
Index DifferentialFunctionality | Designed to provide a list of file differences between two indices (or points in time). The differential list can be used to feed synchronization tools or identify deltas where two repositories should be identical. Outputs a CSV file containing the differences between the two indices. Learn more.
Live ViewFile Action | Allows read-only access to the live directories to search or drill down freshly onboarded data pre-indexing, as well as proactively copy paths. Learn more.
Qumulo Data MoverFile Action | Allows authorized users to move data from Qumulo to Qumulo, Qumulo to AWS, and AWS to Qumulo. Watch demo.
Tag CopierFunctionality | Designed to migrate tags from one index to the next. Generally, these tags are applied post index through manual tag application or plugin tag application (harvest, duplicate hashes, etc.). Learn more.
Unix PermissionsFunctionality | Adds the Unix permissions of each file and directory to the Diskover index at time of indexing. Two tags are added, unixperms-plugin and ugo+rwx, if a file or directory is found with fully open permissions (777 or 666). Learn more.
Windows AttributesFunctionality | Adds Windows file owner, primary group, and DACL info to files and directories in Elasticsearch indices.
Windows OwnerFunctionality | Adds the Windows file owner and primary group of each file and directory to the Diskover index at time of indexing. It replaces all docs showing username 0 with the Windows file/directory owner name. Learn more.

Diskover Data is thrilled to announce the Qumulo Data Mover plugins, allowing authorized users to manually trigger data movement from Qumulo to Qumulo OR Qumulo to/from AWS.

The process is as simple as finding the data you want to move and then launching File Action in one click to start the data movement process.

Both Qumulo and Diskover offer multiple integrated tools to monitor your digital assets migration and behavior. For extremely precise data integrity, the Diskover Hash Differential Plugin is designed to checksum hash values between the original file and the resulting file once it reaches its transfer destination, from Qumulo on-prem to AWS for example, catching and alerting on any possible file corruption in the process.

The first 2.5 minutes of this video summarize the data movement process and the monitoring tools. The rest of the video offers an overview of each platform and the supercharged infrastructure resulting from combining both Diskover and Qumulo.

Exclusive Plugins for Media Edition

Listed alphabetically. Please visit the AJA Diskover Media Edition page for all details.

PLUGINSHORT DESCRIPTIONLearn More Watch Demo
CineViewer PlayerFile Action | Seamless access to third-party platform allowing users to launch the CineViewer Player to view/validate media files, gives visibility to end users without access to the source asset.ReadWatch
File SequenceFile Action | Designed to list any file sequences in a directory or from a single file in a sequence.Read
IMF Package ValidatorFile Action | IMF packages can be scanned and validated before delivery, from any location, regardless of the data location of the IMF package.ReadWatch
Media Info HarvestFunctionality | Adds business context and searchability via additional media file attributes (resolution, codec, etc.). The enriched metadata is key for granular analysis, workflow automation, and data curation.Read
Telestream GLIMFile Action | Seamless access to third-party platform allowing users to launch GLIM to view/validate media files, gives visibility to end users without access to the source asset.ReadWatch
Telestream VantageFile Action | Seamless access to third-party platform allowing users to submit files to Vantage for transcoding directly from the Diskover user interface.ReadWatch
Xytech Asset CreationFunctionality | Adds business context and searchability following assets rehydration via asset ID attributes. The enriched metadata is key for granular analysis, workflow automation, and data curation.ReadWatch
Xytech Order StatusFunctionality | Adds business context and searchability using order status enriched attributes (order phase, invoice date, etc.). The enriched metadata is key for granular analysis, workflow automation, and data curation.ReadWatch

⭐️ Customer Testimony – Vantage Workflow Integration at Visual Data Media Services

“Diskover has the ability to work with other applications if they have an SDK available. In the case of Telestream’s Vantage transcoding workflow platform, we have found that a combination of Diskover search tools and Vantage makes a really good team.

To eliminate the need to give users access to production content, and the ability to search through thousands of files, the Diskover team built a Vantage submit tool that directly allows to search a file, select that specific file or multiple files, and submit them to the Vantage workflow of your choice.

It’s extremely versatile and communicates very quickly. This procedure is only in its first phase so expanding it to allow workflow status submission, and more advanced workflows like audio overlay and subtitle overlays is on the road map.”

Randall Derchan
IT Manager, Visual Data Media Services

Image representing how to select File Action then Submit to Vantage for workflow submission - Visual Data Media Services, a customer of Diskover Data, developed and deployed their own Vantage plugin in order to automate that workflow.
Image representing how to select a workflow within Vantage following launching a customer deployed plugin - Visual Data Media Services, a customer of Diskover Data, developed and deployed their own Vantage plugin in order to automate that workflow.
Customer deployed plugins can easily be added to the File Action features and launched in one click.

Once “submit to Vantage” is selected from File Action, the Vantage software opens, ready for a workflow to be submitted.

Listed alphabetically. Please visit the Life Science Edition page for more details.

PLUGINSHORT DESCRIPTIONLearn More Watch Demo
BAM PluginFunctionality | Allows for the curation of genome sequence file transformation.ReadWatch
Grant PluginFile Action | Assist research institutes in managing their grants/members/storage costs internally, and fulfills the requirements for the new NIH DMS Policy.Read
Graphic with a closed fist at the end of a lightning bolt used to represent that Diskover Data empowers all stakeholders and allow all line of business users to have their own relationship with data. All the file search tools and analytical reports allow for in-depth data analysis and informed decision making.

Unique Indexing Architecture

Diskover File Action Live View giving users access to read-only data pre-indexing allowing for proactive search and copy paths.
Image of a speedometer used to represent the very fast indexing capacity of Diskover.

Diskover customers using optimized cached scanning are reporting that, on average, scans take between 50% to 75% less time to build subsequent indexes. Note that results may vary depending on your infrastructure and storage type. This feature can be found in the Diskover annual subscription editions and is not included in the Community Edition.

Speed and Scalability

🚫 Not a Storage System = Non-Proprietary

Diskover is non-proprietary. We do not store your files, we simply index their metadata. Therefore, users have access to read-only index of files and not the files themselves. Hence, Diskover gives your valuable digital assets the protection they need.

Diagram giving a simple overview of Diskover data management platform and indexing architecture
Elasticsearch open-source software is used in the backend of Diskover for search, speed and scalability capabilities, allowing for massive amounts of data to be searched instantaneously. Elasticsearch also plays a crucial role in indexing all storage volumes in parallel.
This image is the GitHub logo. Diskover has a huge presence on GitHub and offers its free open-source Community software on that platform.

Become a Stargazer on GitHub

Icon representing an arrows in a loop which is used to identify Diskover multiples workflows integration.

Sustainable Solution BENEFITS

The Cost of Unmanaged Data

Data Management Has Become Inescapable

Increases Productivity via Stakeholders Empowerment

This diagram gives an overview of how Diskover empowers all levels of an organization by giving examples of the needs, challenges and solutions for different people with different roles. Diskover allows all stakeholders to have their own relationship with data through different tools and features, facilitated by the global index and access to all data in one single view.
Diskover allows all stakeholders to have their own relationship with data.

Get in Touch to Schedule a Demo or a 30 Day Free Trial

Scroll to Top