Skip to main content

Data Science Skills

Below are some of the data science skills that every data scientist must know:

1. Change is the only constant

It’s not about “Learning Data Science”, it’s about “improving your Data Science skills!

The subjects you are learning currently in Grad School are important because no learning go waste but, the real world practicality is totally different from the theory of the books which is taught for decades. Don’t cramp the information, rather understand the big picture.

A report states that 50% of things that you learn today regarding IT will be outdated in 4 years. Technology can become obsolete but, learning can’t be. You should have the attitude of learning, updating your knowledge and focusing on your skills(Get your Basics clear) and not on the information you learn!

This will help you to survive in this tough and competitive world (I am not scaring you, I am just asking you to prepare your best! You should start focusing on the below skills for becoming a data scientist –

Business Skills
Practical Skills like Math and statistical skills
Coding Skills(As I said, Learn and don’t Cramp)
Soft Skills like People skills, Social skills, Data Visualisation, Presentation and Communication (Emotional intelligence quotients- I feel this is the most imp. one)

2. Essential Data Science Ingredients – Tools for Data Science

Companies employ Data Scientists to help them gain insights about the market and to better their products. There are several tools needed for Data Science. Skills like Big Data Technologies, UNIX, Machine Learning, Python, R, SQL, etc. are needed to master the art. You can start with the last three skills (PYTHON, R, SQL) from now. It will reap you benefits in the future. These three are the most used skills in the current scenario.

3. Become the Jack of all and Master of none for the BELOW points only

a. Develop the mind, discover new things and enhance your imagination- In short, Start Reading.
The above benefits are good enough to start reading NOW! Also, make sure that you start reading outside your discipline. It will help you to get exposure to a wide range of different techniques and problems and to get comfortable jumping feet-first into a new topic.

b. Try to analyze new types of Data
Data can be in any form. You can easily interpret data if it is in the form of text and images. But, try to interpret the Data in a Video or in an Audio. It can be in a Pre-trained model, in a Relational database, and in a Time series form. The last three might be tough for you as of now, but you can start with Video and Audio form.

c. Get Inspired with New Ideas and Be Impacted: Talk to new people
Create a network with people you can learn from. If someone further along the road can mentor you in the right manner, it can add direction to your career. Talk to people with a technical background who are outside your field. You will get new information, ideas, and loop-holes where you can create value.

Also, talking to people with a non-technical background will help you to enhance your soft-skills. You will get a chance to explain to them the technicals of your specific academic background.

d. You can now cry over spilt milk with Version Control
Imagine! You have got control over your actions and just by a click, the actions are revocable and controllable. Wouldn’t life be perfect then?

Don’t know about real life but, you can control that in the reel life. If a mistake is made you can turn back the clock and compare earlier versions of the code to help fix the mistake while minimizing disruption to all team members.

Here you can figure out your mistakes and what you broke. It’s very good for individual projects and can help you to master the art if practiced regularly.

4. Never settle

Here I am not talking about the beautifully designed phone with the premium build quality & the best technology to users around the world. I am asking you to not Stop at “Good Enough”.

This means that if a model is not accurate and needs additional tuning then it should not be left at the good enough stage. This will be a major factor that will differentiate you from the other Data Scientist. Bring perfection in the task and make sure you answer every single question you can with the data. The best thing is to try to add value. If someone else finds it valuable I bet it will be the most valuable work for you.


Comments

Popular posts from this blog

Introduction to Datascience

Data Science has become one of the most demanded jobs of the 21st century. What is Data Science? “Data Science is about extraction, preparation, analysis, visualization, and maintenance of information. It is a cross-disciplinary field which uses scientific methods and processes to draw insights from data. ” As a data scientist, you take a complex business problem, compile research from it, creating it into data, then use that data to solve the problem. A Data Scientist, specializing in Data Science, not only analyzes the data but also uses machine learning algorithms to predict future occurrences of an event. Therefore, we can understand Data Science as a field that deals with data processing, analysis, and extraction of insights from the data using various statistical methods and computer algorithms. It is a multidisciplinary field that combines mathematics, statistics, and computer science. Why Data Science? So, after knowing what exactly Data Science is, you must explore ...

Data Science Interview Questions -Part 2

1) What are the differences between supervised and unsupervised learning? Supervised Learning Unsupervised Learning Uses known and labeled data as input Supervised learning has a feedback mechanism  Most commonly used supervised learning algorithms are decision trees, logistic regression, and support vector machine Uses unlabeled data as input Unsupervised learning has no feedback mechanism  Most commonly used unsupervised learning algorithms are k-means clustering, hierarchical clustering, and apriori algorithm 2) How is logistic regression done? Logistic regression measures the relationship between the dependent variable (our label of what we want to predict) and one or more independent variables (our features) by estimating probability using its underlying logistic function (sigmoid). The image shown below depicts how logistic regression works: The formula and graph for the sigmoid function is as shown: 3) Explain the steps in making a deci...

What is P Value ?

In Data Science interviews, one of the frequently asked questions is ‘What is P-Value?”. According to American Statistical Association, “A p-value is the probability under a specified statistical model that a statistical summary of the data (e.g., the sample mean difference between two compared groups) would be equal to or more extreme than its observed value.”  That’s hard to grasp, yes? Alright, lets understand what really is p value in small meaningful pieces to make it very clear. When and how is p-value used? To understand p-value, you need to understand some background and context behind it. So, let’s start with the basics. p-values are often reported whenever you perform a statistical significance test (like t-test, chi-square test etc). These tests typically return a computed test statistic and the associated p-value. This reported value is used to establish the statistical significance of the relationships being tested. So, whenever you see a p-valu...