Learn how Dataedo helped Vanderbilt University conquer challenges related to standardizing business terms and technical definitions, centralizing their data inventory, and providing essential data lineage insights to enhance their data quality and data literacy.
Vanderbilt University, situated in Nashville, Tennessee, stands as a prominent private research institution. Approximately a decade ago, the university's institutional research group took on the significant task of handling institutional-level reporting. This endeavor led to the establishment of a dedicated data team, which provided support to various academic departments in fulfilling their data-related requirements. However, in the course of their operations, they confronted challenges stemming from lack of clarity in business term definitions and data calculations.
To address these challenges, the university adopted an external tool to establish uniformity and documentation in both business and technical term definitions. Despite the tool's evolution and incorporation of new features, its adoption remained constrained, largely serving a solitary business team's needs, while overlooking the comprehensive data governance demands of the entire organization. Ultimately this led to the termination of the tool's license. Subsequently, the team transferred all data definitions to a spreadsheet, exacerbating the complexities of maintenance.
In recent years, Vanderbilt University has implemented a data governance program, and the data team has undergone organizational restructuring. Newly appointed to the team, the Chief Data Officer (CDO) discerned that certain teams weren't harnessing available data effectively. This observation underscored the necessity for a more centralized approach to data governance. As a strategic response, the decision was made to initiate an internal data governance review of prevailing data practices, processes, and strategies. The goal was clear: identify opportunities for enhancement that would pave the way for a more streamlined and effective data ecosystem.
The internal data governance review brought to light several areas for potential enhancement:
There was pressing need for the team to gain a clear understanding of the range and location of data collected across the university as well as its classification.
To address the gaps identified in the internal review, the decision was made to introduce a data catalog system. This system would act as a central hub, housing comprehensive data documentation and functioning as a gateway to navigate and manage the data landscape effectively.
In their pursuit of an apt data catalog solution, the team embarked on an extensive market analysis, surveying a range of available products and identifying the major contenders in the field. What they encountered were predominantly high-cost systems demanding substantial resources and investments, far surpassing their allocated budget for the year. Fortunately, amidst these major players, Dataedo emerged as a viable option.
Other systems were quite expensive. The capital investment upfront to implement those systems felt like we would need a team of 20 people and a significant commitment, requiring a substantial expenditure over multiple years.
To evaluate how Dataedo would function within their intricate operational landscape, the team opted for a 3-month Proof of Concept*. This endeavor proved to be successful, playing a pivotal role in showcasing the value and compatibility of Dataedo with their requirements.
To rationalize the allocation of resources for Dataedo, the team undertook a thorough cost analysis by juxtaposing it against the expense of other prominent solutions dominating the market. This comparative evaluation revealed substantially higher costs associated with the alternatives. The combination of reasonable pricing, coupled with a seamless and swift implementation process, made Dataedo an easily justifiable choice.
Proof of Concept allowed us to go in and see what it does and how it works in our environment.
*Dataedo offers a 3-month Proof of Concept license for $999 with all features available that allows you to test it out thoroughly in your own environment.
The team's embrace of Dataedo took precedence as they found themselves thrust into uncharted territory—a fresh project involving a system brimming with data concerning alumni and donors, an arena entirely novel to the team. Adding to the complexity, this dataset posed certain quality concerns.
The primary objective was to swiftly comprehend the nature of this data and its potential utility. Recognizing the potential, the team identified avenues where Dataedo could lend its assistance. Leveraging the data profiling feature, they discerned segments of quality data and areas where improvements were needed.
The documentation process was methodically divided between the data governance program and data stewards. The data governance team undertook the responsibility of tool administration and configuration, establishing a structured framework that guided the data stewards in documenting the essential elements. This approach ensured consistency throughout the entire spectrum of documentation, catering to all stakeholders uniformly. Furthermore, the data governance team extended crucial support and conducted requisite training sessions.
Reins are being handed over to the data stewards, capitalizing on their specialized knowledge to intricately document the data aspects. This collaboration between data governance and data stewards ensures a comprehensive and coherent documentation process.
The team recognized the substantial value offered by the data lineage feature, particularly in critical scenarios such as incident response. Data lineage within Dataedo facilitated not only the documentation of terms but also provided a clear trajectory of data origins. This enabled them to trace the data's path back to its source systems and comprehend the transformations it underwent during its journey.
Data Lineage not only tells us where the data came from within the database, data warehouse etc., but also allows us to track this lineage back to the source system. This is crucial from internal audit perspective.
Moreover, the team recognized Dataedo as an ideal solution for tackling data quality concerns and implementing industry best practices, with data lineage emerging as a pivotal component within this context.
Furthermore, Dataedo played a pivotal role in enhancing data literacy within the organization. It became a powerful tool for educating data consumers about the nuances of data terms and enlightening them about the sources of these terms. In this regard, the data lineage feature once again demonstrated its tremendous value.
Vanderbilt University's journey to implement Dataedo as part of its data governance strategy showcases the challenges and opportunities that educational institutions face when managing data in a diverse and complex environment. By focusing on one data set first and collaborating with data experts, the institution was able to lay a strong foundation for accurate data documentation, classification, and improved data governance practices. The ongoing efforts to educate stakeholders and promote a culture of data stewardship remain critical to their success in the evolving landscape of data management.
For other educational institutions seeking a data catalog solution, Vanderbilt University recommends the following approach:
You need to figure out what it is that you are trying to accomplish before you start looking at toolsets that help you do that.