No Data Engineers, No Problem: How a Small Data Science Team Can Standardize and Centralize Data with a Data Library

Chris Umphlett IN Experfy Insights, Blog series

Introduction

Access to and control of data is one of the biggest challenges faced by data analysts and data scientists.  Creative, persistent analysts find ways to get access to at least some of this data but doing that efficiently in a way that is also standardized and centralized for everyone on the team is difficult. These teams may not have the budget, skills, or IT support needed to successfully implement a data management application. In this series, I will explain a principled approach to home-grown data management that has a low technical barrier to entry and is platform-agnostic: the Data Library.

Big Data, Cloud & DevOps
Introduction To Data Libraries For Small Data Science Teams
At smaller companies access to and control of data is one of the biggest challenges faced by data analysts and data scientists. The same is true at larger companies when an analytics team is forced to navigate bureaucracy, cybersecurity and over-taxed IT, rather than benefit from a team of data engineers dedicated to collecting and
3 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
A Tech-Agnostic, Principled-Approach To Grassroots Data Management
In the introduction to this series, I explained what a data library is and how it can help a small data analytics team that lacks formal business intelligence support create a solid foundation for data management. This article will explain the universal principles that should guide the development of a data library. Let’s Look At
5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
The “Operationalized” Data Library- Using Your Data Library To Create Value Quickly And Efficiently
In previous articles in this series on the usage of a data library I dove into the first two of the four characteristics of a data library. This article will explain how the last two characteristics come together in the “operationalization” of your data library. What is a data library? * A set of principles
6 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
Examples Of How To Implement Each Principle Of A Data Library
In the previous article I explained the technology-agnostic principles behind a good data library. This article gives specific examples of how these principles may be implemented. Let’s dive in to the examples of how to implement Data Library principles Automation There are several components to successful automation. The most obvious one is the ability to
6 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
Organizing A Data Library
So far in this series I have explained the concept of a data library and the principles behind it. Now I will explain how it interacts with the various building and water metaphors for data storage. There is no shortage of data metaphors to draw from for your data library Metaphors explaining how data should
5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
Prioritizing Data Sources For Your Data Library
In the previous article, I prescribe prioritizing data sources inclusion in a data library according to business value, difficulty, and privacy concerns. This can be done utilizing a scoring rubric and interviewing the owners and/or key stakeholders of each data source. While these things may not be measurable they can be quantified in a relative
5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps
Creating A Repeatable Data Library Process
In this final article in a series on how small analytics teams can build a self-managed data library for effective data management, I will summarize the previous articles and show how to put it all together into a repeatable process. A Data Library is Built on a Set of Principles for Data Management, not a
3 MINUTES READ Continue Reading

  • Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Leave a Comment
    Next Post

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    Blog series
    AI in Five, Fifty, and Five Hundred Years

    Introduction Prediction is a tricky business. You have to step outside of your comfort zone, your fainted vision of the world and see it thorough across all possible dimensions. In this series, we will discuss the future of “AI”, applications that are yet unexplored.

    1 MINUTES READ Continue Reading
    Blog series
    Ethics of Emerging Technologies

    Introduction: Humans are wired to make tough decisions bringing all the context and principles to bear. Similarly, can devices apply the available information to make the right judgment calls? In this series, we shall discuss some ethical dilemmas faced by emerging technologies.

    1 MINUTES READ Continue Reading
    Blog series
    How to Become a Data Scientist

    Introduction: Certain skill sets suit certain positions better than others, and this is why the path to data science is not uniform and can be via a diverse range of fields such as statistics, computer science and other scientific disciplines. This series aims to present 3 aspects of ‘How to become a Data Scientist’ starting

    1 MINUTES READ Continue Reading