Big Data, Cloud & DevOps

Ten tips for Avoiding an Alternative Data Hangover

Investors in the financial industry are now having to confront the challenge of managing a large volume of data in this unstructured format, assembling in-house data scientists, engineers and IT staff who can transform it into insights. This is an extremely lengthy and expensive process. The majority of buy-sides do not have access to these types of resources, and that’s why big data vendors are essential. For hedge funds, asset managers and banks looking for a big data vendor,we have narrowed down the top 10 key areas to consider when deciding on an alternative data vendor.

How Data Exhaust can be Leveraged to Benefit your Company

Most big data programs are focused on certain types of data. These essential data types are considered most relevant to the organization’s overall goals. But what about all the data left over? Data exhaust can offer businesses significant value – if it’s leveraged properly. No, it doesn’t have to do with being exhausted by the amount of data your business collects although that’s a common sentiment among executives. Instead, it has to do with the amount of “leftover” data produced by an organization. When you set out to collect specific types of data, other information is collected at the same time.

3 MINUTES READ Continue Reading »

How Data Science Is Enabling Better Decision-making

By harnessing data science to its full potential, top-ranking decision makers in all industries, not only make better-informed decisions but make them with clearer predictions of the future. With that advantage on their side, they are able to stabilize businesses that have not always had a clear vision and save businesses that are on the brink of collapse. Once goals have been established, data scientists can work their magic and theorize how to fix it. Data science alone is not an advantage for decision-making, data science combined with good leadership is.

4 MINUTES READ Continue Reading »
  • Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Why “True” Real-time Data Matters: The Risks of Processing Data-at-rest Rather than Data-in-motion

    What do I mean by true real-time data? It is data that has just been generated and never been stored. Because once data has been stored, no matter for how long, it is no longer real-time. Can you imagine making vital business decisions based on three-month-old insights? How about a week old? Or a day old? Minutes-old data can be irrelevant for the real-time decisions that matter most to your business, yet many people don’t understand the difference between real-time analytics with real-time data and real-time analytics with stale data.

    2 MINUTES READ Continue Reading »

    Five Amazing Improvement Big Data Can Bring to Retail

    Data is slowly replacing experience and traditions in the way companies do business. It has already proven its value in different verticals, including finance, healthcare, and of course, retail. The first obstacle is to define the scope of the Big Data project. What are the most critical questions the company needs to answer? What data sources should they analyze? Are these already available? Is the data clean and reliable? While in the initial implementation phase Big Data-related modifications can affect usual workflows and slow down business, the opposite happens after they are set in place.

    4 MINUTES READ Continue Reading »

    What does it mean to be a Senior Data Scientist?

    What do we expect from a Senior Data Scientist? Senior Data Scientists understand that Software/ Machine Learning has a lifecycle and so spend a lot of time thinking about that. They understand that ‘data’ always have flaws. These flaws can be data generating processes, biases in data. They understand the ‘soft’ side of technical decision making, focus on impact and value, and care about ethics. At the very least Senior Data Scientists should read some of the code of ethics in Data Science and have views on these. Ideally, you should have your own code of ethics, and maybe enforce those on yourself.

    4 MINUTES READ Continue Reading »

    Data Preprocessing for Non-Techies: Feature Exploration and Engineering, Part Two — Checklist of Most Common Practices

    We have covered the basic terms and definitions for data types and structure on my previous post, let’s dive into the creative and most time consuming side of data science — cleaning and feature engineering. What are some of the basic strategies that data scientists use to clean their data AND improve the amount of information they get from it?The type of cleaning and engineering strategies used usually depend on the business problem and type of target variable, since this will influence the algorithm and data preparation requirements.

    1 MINUTES READ Continue Reading »

    Data Preprocessing For Non-Techies: Basic Terms and Definitions, Part One — Data Structures, Types and Values

    Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab. If you are getting started in your data science journey and don’t come from a technical background (like me), then you definitely understand the struggle of keeping up with the terminology of data

    4 MINUTES READ Continue Reading »

    Hiring Data Scientists Step 1: Stop Looking for Data Scientists.

    Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab. Dear world, We are looking for someone to fill an upcoming gap in our business model. We are not exactly sure what you will be doing, but we are sure that our shareholders

    5 MINUTES READ Continue Reading »

    A Great Pitfall: Neglecting Validation

    Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab.   Photo by Paul on Unsplash It was in the first chapter in your book, it was in the first lecture you attended, or it was mentioned in the first tutorial you watched. It seems simple:

    7 MINUTES READ Continue Reading »

    APM for Big Data: An Architect’s Guide

    Ready for Big Data Training & Certification? Browse courses like Big Data – What Every Manager Needs to Know developed by industry thought leaders and Experfy in Harvard Innovation Lab. Explore the area of big data Application Performance Management (APM) and why enterprises need it. APM is not a new discipline, but it is a new best

    4 MINUTES READ Continue Reading »

    Why DataOps May Be in Your Future

    Ready to become DevOps Engineer? Browse courses like AWS Certified DevOps Engineer—Professional Exam Training developed by industry thought leaders and Experfy in Harvard Innovation Lab. Recalling my college student days, I always adored professors who thoroughly walked through what syllabus topics would appear on an upcoming test.  That effort made it easier to prepare, and even

    3 MINUTES READ Continue Reading »