Skip to main content

Myth about Data Science - A must know for all Data Science enthusiast


1. Only Coder /Programmer can only become a Data Science

No, its not correct. People who is having Basic Programming skills like Python/R or atleast who can learn basic programming can come in to this field.Here i wanted to suggest people who is having Engineering background /Software they can choose Python as a programming and The person who wanted to transit their career in to data science field but coming from non Engineering background like Arts,Commerce,Science they can prefer R as a Programming language . Here am not saying for non technical background can not learn python , its bit difficult to understand the basic and algorithm but if they are ready to learn no issues, they can take any of these either Python or R., I have Mentioned while choosing any of these which one is good according to me in another article i.e python, you can refer my article to get better understanding.

2. Data Scientist are master of all technology .

No, fact is that you should have knowledge on basic data science skills , you dont required to be expertise. A famous line " Jack of all trades " we should have knowledge or basic idea about terminology , algorithm how to apply on the data , basic statistical skill , no one would master in this Era, specially when technology is keep on growing. So i can say you must familiar about the terminilogy and logic and their uses and main thing how to apply these in our data.

3. Data Science is all about tools and technology.

No , its not . Data science is not just about tools and technology, because just applying some lines of code and executing the algorithm and getting the good accuracy is not data science, you should know how to interpret the algoithm and understanding the algorithm /Choosing the algoirthm which is best for the particular problems which is correct or not.

4. Data Analyst and Data Science both works same:


“A data scientist is someone who can predict the future based on past patterns whereas a data analyst is someone who merely curates meaningful insights from data.”

“A data scientist job roles involves estimating the unknown whilst a data analyst job roles involves looking at the known from new perspectives.”

“A data scientist is expected to generate their own questions while a data analyst finds answers to a given set of questions from data.”

“A data analyst addresses business problems but a data scientist not just addresses business problems but picks up those problems that will have the most business value once solved.”

“Data analysts are the one who do the day-to-day analysis stuff but data scientists have the what ifs.”

This is what Abraham Cabangbang, Senior Data Scientist at LinkedIn commented on the difference between data analyst and data scientist -

“It’s definitely a gray area. At my previous company I did both analyst and Data scientist jobs and as an analyst we were more customer facing; the tasks we did were directly related to the tangible business needs—what the customers wanted/requested. It was very directed. The scientist role is a little more free form. The first thing I did as a data scientist is work on building out internal dashboards, basically surfacing information that we were tracking on the back end, but weren’t being used by the data analysts for any reasons; for example, we might have lacked the infrastructure to display it, or the data was just not very well processed. It really wasn’t anything tailored out from a customer need, but came from what I noticed the analyst team needed in order to do their job.”.

Comments

Popular posts from this blog

Deep Learning Interview Questions - Part 1

Q1. What do you mean by Deep Learning?  Deep Learning  is nothing but a paradigm of machine learning which has shown incredible promise in recent years. This is because of the fact that Deep Learning shows a great analogy with the functioning of the human brain. Q2. What is the difference between machine learning and deep learning? Machine learning is a field of computer science that gives computers the ability to learn without being explicitly programmed. Machine learning can be categorised in the following three categories. Supervised machine learning, Unsupervised machine learning, Reinforcement learning Deep Learning is a subfield of machine learning concerned with algorithms inspired by the structure and function of the brain called artificial neural networks. Q3. What, in your opinion, is the reason for the popularity of Deep Learning in recent times? Now although Deep Learning has been around for many years, the major breakthroughs from these te...

How to deal with missing values in data cleaning

The data you inherit for analysis will come from multiple sources and would have been pulled adhoc. So this data will not be immediately ready for you to run any kind of model on. One of the most common issues you will have to deal with is missing values in the dataset. There are many reasons why values might be missing - intentional, user did not fill up, online forms broken, accidentally deleted, legacy issues etc.  Either way you will need to fix this problem. There are 3 ways to do this - either you will ignore the missing values, delete the missing value rows or fill the missing values with an approximation. Its easiest to just drop the missing observations but you need to very careful before you do that, because the absence of a value might actually be conveying some information about the data pattern. If you decide to drop missing values : df_no_missing = df.dropna() will drop any rows with any value missing. Even if some values are available in a row it will still get dropp...

Mathematics & Statistics for Data Science

Converting raw and quantitative data into organized and informative information needs a lot of brain power and understanding. It is true that everyone can’t be Aryabhatta but, you can be hardworking, focused and dedicated. So, it is time to show your dedication and hard work for learning maths and statistics for data science. Mathematics and Statistics are two of the most important concepts of Data Science. Data Science revolves around these two fields and draws their concepts to operate on the data. Today, we will explore the various concepts that build up data science and their practical usages in this field.Data Science has become a trending technology in the world today. In order to learn data science, you must reinforce your knowledge of mathematics and statistics. So let’s first explore how much maths is required for data science – Math for Data Science Mathematics has created an impact on every discipline. The magnitude of the usage of mathematics varies according to the ...