Blog series

No Data Engineers, No Problem: How a Small Data Science Team Can Standardize and Centralize Data with a Data Library

Chris Umphlett IN Experfy Insights, Blog series

September 8, 2023 | Experfy Insights, Blog series

Introduction

Access to and control of data is one of the biggest challenges faced by data analysts and data scientists. Creative, persistent analysts find ways to get access to at least some of this data but doing that efficiently in a way that is also standardized and centralized for everyone on the team is difficult. These teams may not have the budget, skills, or IT support needed to successfully implement a data management application. In this series, I will explain a principled approach to home-grown data management that has a low technical barrier to entry and is platform-agnostic: the Data Library.

Introduction To Data Libraries For Small Data Science Teams

At smaller companies access to and control of data is one of the biggest challenges faced by data analysts and data scientists. The same is true at larger companies when an analytics team is forced to navigate bureaucracy, cybersecurity and over-taxed IT, rather than benefit from a team of data engineers dedicated to collecting and

3 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

A Tech-Agnostic, Principled-Approach To Grassroots Data Management

In the introduction to this series, I explained what a data library is and how it can help a small data analytics team that lacks formal business intelligence support create a solid foundation for data management. This article will explain the universal principles that should guide the development of a data library. Let’s Look At

5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

The “Operationalized” Data Library- Using Your Data Library To Create Value Quickly And Efficiently

In previous articles in this series on the usage of a data library I dove into the first two of the four characteristics of a data library. This article will explain how the last two characteristics come together in the “operationalization” of your data library. What is a data library? * A set of principles

6 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

Examples Of How To Implement Each Principle Of A Data Library

In the previous article I explained the technology-agnostic principles behind a good data library. This article gives specific examples of how these principles may be implemented. Let’s dive in to the examples of how to implement Data Library principles Automation There are several components to successful automation. The most obvious one is the ability to

6 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

Organizing A Data Library

So far in this series I have explained the concept of a data library and the principles behind it. Now I will explain how it interacts with the various building and water metaphors for data storage. There is no shortage of data metaphors to draw from for your data library Metaphors explaining how data should

5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

Prioritizing Data Sources For Your Data Library

In the previous article, I prescribe prioritizing data sources inclusion in a data library according to business value, difficulty, and privacy concerns. This can be done utilizing a scoring rubric and interviewing the owners and/or key stakeholders of each data source. While these things may not be measurable they can be quantified in a relative

5 MINUTES READ Continue Reading

Big Data, Cloud & DevOps

Creating A Repeatable Data Library Process

In this final article in a series on how small analytics teams can build a self-managed data library for effective data management, I will summarize the previous articles and show how to put it all together into a repeatable process. A Data Library is Built on a Set of Principles for Data Management, not a

3 MINUTES READ Continue Reading

Managing The Big Data Project – Lifecycle, Approach, Team Composition, Pitfalls

Leave a Reply Cancel reply

Blog series

AI in Five, Fifty, and Five Hundred Years

Introduction Prediction is a tricky business. You have to step outside of your comfort zone, your fainted vision of the world and see it thorough across all possible dimensions. In this series, we will discuss the future of “AI”, applications that are yet unexplored.

1 MINUTES READ Continue Reading

Blog series

Ethics of Emerging Technologies

Introduction: Humans are wired to make tough decisions bringing all the context and principles to bear. Similarly, can devices apply the available information to make the right judgment calls? In this series, we shall discuss some ethical dilemmas faced by emerging technologies.

1 MINUTES READ Continue Reading

Blog series

How to Become a Data Scientist

Introduction: Certain skill sets suit certain positions better than others, and this is why the path to data science is not uniform and can be via a diverse range of fields such as statistics, computer science and other scientific disciplines. This series aims to present 3 aspects of ‘How to become a Data Scientist’ starting

1 MINUTES READ Continue Reading

About Us

Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

Join Us At

1700 West Park Drive, Suite 190
Westborough, MA 01581

Email: support@experfy.com

Toll Free: (844) EXPERFY or
(844) 397-3739