Data Science Digest

Ankit Rathi Ankit Rathi
May 3, 2018 Big Data, Cloud & DevOps

Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab.

Data Science Digest

Data Science is an amalgamation of many other fields like mathematics, technology & domain; it has its own concepts, process & tools. It’s really tough to know each and everything related to the subject unless you have really worked on complex data science problems in the industry for a couple of years.

In this post, I have tried to aggregate & organize all the data science related topics from Quora (generic definitions), Medium (in-depth working) & GitHub (code). This post is organized in these sections of data science area:

  1. Introduction
  2. Prerequisites
  3. Concepts
  4. Algorithms
  5. Process
  6. Tools

Data Science Introduction

In this section, you can get introduced to data science world. What is data science? Why it is important? What is the difference between Artificial Intelligence, Data Science, Machine Learning & Deep Learning?

  • What is Data Science?
  • Why Data Science is important?
  • Artificial Intelligence Vs Data Science Vs Machine Learning Vs Deep Learning

Data Science Prerequisites

Before diving deep into data science, one needs to cover a lot of ground like decent understanding of linear algebra, statistics, probability & data engineering.

  • Linear Algebra
  • Statistics
  • Probability Theory
  • Data Engineering

Data Science Concepts

In this section, you can learn the data science concepts like types of learning and when to use which kind of learning algorithms?

  • Supervised Learning (Regression, Classification)
  • Unsupervised Learning (Clustering, Anomaly Detection)
  • Reinforcement Learning
  • Deep Learning (Artificial Neural Networks)

Data Science Algorithms

This section covers various (mostly used) data science algorithms in detail. Which kind of problems these algorithms solve & what are the pros & cons of using these algorithms?

  • Classification (k-Nearest Neighbors, Logistic Regression, Decision Trees, Naive Bayes)
  • Regression (Linear, Polynomial, Ridge, Lasso, ElasticNet)
  • Support Vector Machines
  • Neural Nets
  • Random Forests
  • Clustering (K-Means, Mean-Shift, DBSCAN, EM-GMM, Agglomerative Hierarchical)
  • Deep Learning (CNNs, RNNs, LSTMs)

Data Science Process

In this section, you will get to know data science as a process; once you have a problem, what approach will you take? How will you collect & clean data? Which evaluation and tuning technique will you use to optimize your data science algorithm.

  • Data Science Process (Data Collection, Data Cleaning, Modeling, Model Evaluation, Model Tuning, Prediction)
  • Exploratory Data Analysis
  • Feature Engineering
  • Ensembling (Bagging, Boosting & Stacking)

Data Science Tools

This section covers the tools being used in data science field like R, Python, SQL or machine learning platforms provided by Azure & Amazon.

  • R
  • Python (TensorFlow, Keras)
  • SQL
  • Azure Machine Learning
  • Amazon Machine Learning
  • Experfy Insights

    Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Ankit Rathi

    Tags
    Data Science
    © 2021, Experfy Inc. All rights reserved.
    Leave a Comment
    Next Post
    Two-Speed IT is Obsolete: Moving towards Full-Speed Agile and DevOps

    Two-Speed IT is Obsolete: Moving towards Full-Speed Agile and DevOps

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    More in Big Data, Cloud & DevOps
    Big Data, Cloud & DevOps
    Cognitive Load Of Being On Call: 6 Tips To Address It

    If you’ve ever been on call, you’ve probably experienced the pain of being woken up at 4 a.m., unactionable alerts, alerts going to the wrong team, and other unfortunate events. But, there’s an aspect of being on call that is less talked about, but even more ubiquitous – the cognitive load. “Cognitive load” has perhaps

    5 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    How To Refine 360 Customer View With Next Generation Data Matching

    Knowing your customer in the digital age Want to know more about your customers? About their demographics, personal choices, and preferable buying journey? Who do you think is the best source for such insights? You’re right. The customer. But, in a fast-paced world, it is almost impossible to extract all relevant information about a customer

    4 MINUTES READ Continue Reading »
    Big Data, Cloud & DevOps
    3 Ways Businesses Can Use Cloud Computing To The Fullest

    Cloud computing is the anytime, anywhere delivery of IT services like compute, storage, networking, and application software over the internet to end-users. The underlying physical resources, as well as processes, are masked to the end-user, who accesses only the files and apps they want. Companies (usually) pay for only the cloud computing services they use,

    7 MINUTES READ Continue Reading »

    About Us

    Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

    Join Us At

    Contact Us

    1700 West Park Drive, Suite 190
    Westborough, MA 01581

    Email: support@experfy.com

    Toll Free: (844) EXPERFY or
    (844) 397-3739

    © 2025, Experfy Inc. All rights reserved.