Ready to learn Data Science? Browse courses like Data Science Training and Certification developed by industry thought leaders and Experfy in Harvard Innovation Lab.
We wanted to follow up our previous piece about how to grow as a data scientist with some other skills senior data scientists should have. Our hope is to bridge the gap between business managers and technical data scientists by creating clear goals senior data scientists can aim for. Both entities have to take on very different problems. Both benefit when they are on the same page. This is why the previous post focused so highly on communication. It seems simple, but the gap between technical and business continues to grow as new technologies keep getting piled on every year. Thus, we find it important that managers and data scientists have a clear path of expectations.
We find that is beneficial when data scientists are starting their journey that they focus heavily on the technical aspects. This means programming, queries, data cleansing, etc. However, as data scientists grow. They need to focus more on design decisions and communication with management. This will multiply the impact of the more experienced data scientists knowledge. Instead of being stuck in the day to day of coding. They can make higher level decisions and help the younger data scientists if they get stuck. More experienced data scientists benefit both themselves and their companies more when they are utilizing their experience to help make design decisions that simplify complex systems, optimize data flows, and help make decisions on what projects are most pertinent.
Being Able To Simplify The Complex
Data scientists have a tendency to want to use every technique and algorithm they know on every problem and in every solution. In turn, this creates complex systems that are difficult to maintain.
Data science does require complex and abstract modeling as well as plethora of complex technologies (from Hadoop to Tensorflow). With all the complexity that surrounds the field, it is tempting to develop systems and algorithms that are in turn complex. There is the temptation to involve 4 or 5 different technologies and utilize every new hot algorithm or framework. However, like most other fields that have some engineering involved. Reducing complexity is often better for multiple reasons.
The role of an engineer is to simplify a task. If you have ever built or seen a Rube Goldberg machine you will understand the idea of over engineering a simple task. Some data scientists algorithms and data systems would look more like some crazy mouse trap held together by duct tape and gum instead of an elegant but effective solution. Making simpler systems means the systems will be easier to maintain over time as well as provide future data scientists the ability to add and take away modules as needed. The next data scientist taking your position will thank you if you create a simple framework. On the other hand, if you use 3 different languages, 2 data sources, 10 algorithms and leave no documentation, then just know the future engineer is cursing your name under his breath.
Simple algorithms and systems also allow for easier additions and subtractions to be made. Thus, as technology changes and updates are required or a module needs to be taken out. A poor future data scientist isn’t stuck with playing a game of Jenga with your code. If I remove this block of code, will everything fall apart(have you heard of technical debt?)
Knowing How To Mesh Data Without Primary Keys
One of the big values strong data experts should provide is tying together data sets that might not inherently have a primary or obvious connection. Data can represent a person or business’s day to day interactions. Having the ability to find statistical patterns in this data is what allows data scientists the ability to help decision makers make wise choices. However, the data you would like to mesh together is not always on the same system or the same granularity.
Those who have worked with data will know it is not always integrated together nicely in one database. Finance data is often kept separate from IT Service Management data, and outside data sources might not have the same level of aggregation. This is a problem because to find value in data sometimes requires data from other departments and systems.
For example, what if you are provided medical claims, credit card and criminal rates of neighborhoods and want to figure out how these socio-economic factors affect the patient?. Some sets of data might be on a person by person level while the others might be on a street or city level with no clear method to connect the the data sets. What is the best way to proceed? This becomes a design issue that one, must be recorded and two must be thought out.
Each situation is different as there are many ways to mesh data. It could be based of region, traits, spending habits, etc. This is why experience is important. An experienced data scientist will have the intuition on how the data can be joined. Mostly because they have already tried hundreds of methods that don’t work. Often times, the closer you can combine both data sets to person by person the better. So if region or city happens to be the lowest level (Lowest level refers to granularity of the data, like person level, household level, street level, city level, state level, or many other groupings ) of connection, then that would be a great place to start.
Being Able To Prioritize Projects
As a data scientist, you have to know how to explain the ROI of projects that might not turn out. This is just about good direct communication(Our team will never stop talking about communication). This is about being able to articulate value as well as prioritize long term vs short term goals(again, easier said than done).
Teams will always have more projects and project requests than they can handle. More experienced team members need to take the lead and help their managers decide which projects are actually worth taking on. There is a fine balance between quick projects that might not have the highest ROI but have a good chance of succeeding and long term projects that are more likely to fail but also provide a large ROI.
In this case it is good to have a decision matrix of sorts to help simplify the process.
One of the classical decision matrices for projects is a 2 by 2 matrix that is importance and urgency. This matrix can be found in most business courses at college and it is really simple. That is why it is great!
I have worked at companies with really smart people. Yet, every project was treated as a priority and if you haven’t heard the saying, we will say it here.
If everything is a priority then nothing is.
Many other companies have this problem. This is why it is important for the experienced members of the data science teams to be to clearly articulate which projects really should be done now, vs later. Thus, using the simple matrix will do that.
(Like we said in our last post, being concise is important. Using the matrix to help specify ROI will help).
When there is concise and straightforward communication, projects continue to move forward and trust is built.
Being Able To Develop Robust And Optimal Systems
Making an algorithm or model that operates in a controlled environment is one thing. Integrating a robust model into a system that is live and deals with massive amounts of data is a whole other thing. Depending on the company, sometimes the data scientist will just have to develop the algorithm itself. Then either a developer or machine learning engineer will be responsible for putting it into production.
However, this is not always the case. Smaller companies, and smaller teams might have the data science team put the code into production. This means the algorithm needs to be able to manage the data traffic at a reasonable speed. If your algorithm takes 3 hours to run and needs to be accessed live. It is not going into production. Thus, good system design and optimization is necessary.
Data science is complex field that requires an understanding of data, statistics, programming, and subject matters. In order to grow, data scientists need to be able to simplify and distill these complexities into algorithms. They need to be able to focus more on making design decisions. This helps maximize their knowledge and experience that they have.
Summary
Senior data experts provide the largest impact for both themselves and their companies when they go beyond their technical abilities. The value they bring to the table is their experience, it can help guide younger developers to make better design decisions, and help managers make better decisions on which projects will have the best ROI. In turn, this magnifies the impact of their involvement on the team.
Call To Action
Are you an executive or director that needs help improving your communication between your data science team and your business owners? We want to help! Our team specializes in seminars to help improve communication and output of your data driven teams. Reach out to us today!
Interested In Reading More About Being A Better Data Scientist?
How To Grow As A Data Scientist
Boosting Bagging And Building Better Algorithms
How To Survive Corporate Politics As A Data Scientist
8 Top Python Libraries For Machine Learning
There’s certainly a lot to find out about this topic. I like
all of the points you made.
Thank you Emilio! Glad you find the articles useful.
Please subscribe to our blog for more updates.
If some one wants to be updated with hottest technologies then he
must be pay a quick visit this web site and be up to date daily.
Thank you. Please subscribe to receive more updates.
Hiya! I know this is kinda off topic however I’d figured
I’d ask. Would you be interested in trading links or maybe guest authoring a blog article or vice-versa?
My blog discusses a lot of the same topics as
yours and I think we could greatly benefit from each other.
If you are interested feel free to shoot me an e-mail.
I look forward to hearing from you! Wonderful
blog by the way!
What’s up, just wanted to tell you, I enjoyed this article.
It was helpful. Keep on posting!
Thank you. Please subscribe to receive more updates.
I’ve read a few excellent stuff here. Definitely price bookmarking for revisiting.
I surprise how so much effort you set to make this sort of wonderful informative site.
You could certainly see your expertise in the article you write.
The sector hopes for more passionate writers such as you who
are not afraid to say how they believe. At all times
go after your heart.
I pay a quick visit day-to-day some web sites and information sites to read articles or reviews, except this weblog gives feature based articles.
This site was… how do you say it? Relevant!! Finally I’ve found something that helped me.
Thanks a lot!
Pretty great post. I just stumbled upon your blog and wanted to mention that I
have really loved browsing your weblog posts. In any case I’ll be subscribing on your feed and I
hope you write again soon!
A fascinating discussion is worth comment.
I believe that you ought to publish more on this subject matter, it may not be a taboo matter but usually people don’t speak about
such subjects. To the next! All the best!!
Thank you for sharing your info. I truly appreciate your efforts and I am waiting for your further post thanks once again.
Thank you. Please subscribe here
Hi,
Sorry about the experience. We are surely working on improving the functionality.
Sorry about the experience. We are surely working on improving the functionality.
Sorry about the experience. We are surely working on improving the functionality.
Sorry about the experience. We are surely working on improving the functionality.
Sorry about the experience. We are surely working on improving the functionality.
Thank you. Please subscribe to receive more updates.
Thank you. Please subscribe to receive more updates.
Thank you. Please subscribe to receive more updates.
Sorry about the experience. We are surely working on improving the functionality.
Thank you. Please subscribe to receive more updates.
I’m more than happy to discover this great site.
I wanted to thank you for your time for this fantastic read!!
I definitely liked every bit of it and I have you book-marked to look
at new stuff in your blog.
Thanks very interesting blog!
Keep on working, great job!
There’s certainly a great deal to know about this issue.
I love all the points you have made.
I do trust all the concepts you have introduced for your post.
They’re really convincing and can certainly work.
Still, the posts are very quick for starters. Could you please extend
them a bit from next time? Thanks for the post.
Thanks Joe. Please subscribe to Experfy blog.
I always spent my half an hour to read this web site’s posts everyday along with
a mug of coffee.
Thanks Ludie.
Greate post. Keep writing such kind of info on your blog.
Im really impressed by it.
Hey there, You’ve done a great job. I’ll definitely digg it and in my opinion recommend to my
friends. I’m sure they will be benefited from this website.
Good post. I definitely love this website. Stick with it!
Thanks Patricia. Please subscribe to Experfy blog.
Appreciate the recommendation. Let me try it out.
Awesome! Its truly awesome post, I have got much clear idea concerning
from this article.
Good way of describing, and nice paragraph to get information about my presentation focus, which i am going
to convey in university.
There’s definately a great deal to learn about this subject.
I like all the points you’ve made.
If some one needs expert view concerning blogging and site-building afterward i suggest him/her to visit this
blog, Keep up the nice work.
Fine way of telling, and nice piece of writing to take data regarding my presentation focus,
which i am going to convey in academy.
Hey There. I found your blog using msn. This is a very well written article.
I’ll make sure to bookmark it and return to read more of your useful
information. Thanks for the post. I’ll certainly comeback.
Hello there! Do you use Twitter? I’d like to
follow you if that would be ok. I’m undoubtedly enjoying your blog and look
forward to new updates.
Thank you Epifania. Our Twitter handle is https://twitter.com/experfy
Please subscribe to Experfy blog
Why users still make use of to read news papers when in this
technological world everything is accessible
on web?
Hello to every single one, it’s really a good for me to visit this website, it consists of
useful Information.
Thank you Phillis. Please subscribe to Experfy blog
What’s up, after reading this amazing article i am as well delighted to share my
knowledge here with colleagues.
Thank you Magda. Please subscribe to Experfy blog
Good way of describing, and nice post to obtain information concerning my presentation subject, which i am going to present in university.
Thank you Tami. Please subscribe to Experfy blog
Hello there! I could have sworn I’ve been to this web site before but after going through some of the articles I realized it’s new to me.
Regardless, I’m certainly happy I found it and I’ll be bookmarking it and
checking back regularly!
Thanks Lonnie!
My family members always say that I am killing my time here at
net, however I know I am getting know-how all the
time by reading thes good articles.
Thanks Clifford. Please subscribe to Experfy blog
Very nice post. I just stumbled upon your weblog
and wanted to say that I have truly enjoyed
surfing around your blog posts. After all I’ll be subscribing
to your rss feed and I hope you write again very soon!
Your method of describing all in this piece of writing is
genuinely nice, every one be capable of without difficulty know it, Thanks a
lot.
Thank you for some other great article. The place else may just anyone get
that type of info in such an ideal method of writing? I’ve
a presentation next week, and I’m at the search for
such info.
Thanks Lakesha! Please subscribe to Experfy blog
Thank you for the feedback. Our backend team is working to fix the issue.
Thank you Giuseppe. Please subscribe to Experfy blog
Woah! I’m really loving the template/theme of this blog. It’s simple, yet effective.
A lot of times it’s tough to get that “perfect balance” between superb usability and visual appearance.
I must say that you’ve done a great job with this. Additionally, the blog loads very fast for me on Chrome.
Superb Blog!
Hello! I just wish to offer you a big thumbs up for your great info you’ve got here
on this post. I am coming back to your web site for more soon.
I’m very pleased to discover this website. I want to to thank you for ones time just for this
wonderful read!! I definitely enjoyed every bit of it and I have you bookmarked to see new stuff in your blog.