4 Things You Should Know For A Career In Data Analytics

4 Things You Should Know For A Career In Data Analytics

Questioning and answering. At times you may feel that you have found the correct answer. I assure you that this is a total delusion on your part. You will never find the correct, absolute, and final answer. — Professor Kingsfield, The Paper Chase

All kinds of organizations today are hiring data analysts to make sense of the growing amount and range of data they generate and collect. Wringing actionable answers out of data has become a key business skill. All kinds of organizations collect big data and want to use it to make or improve decisions. Firms in fields as varied at B2B and B2C commerce, health care, manufacturing, and marketing all use data analytics to improve processes and enhance profits.

For example, "Medicine uses data analytics in clinical studies to predict the efficacy of medicines and survival rates," says Carl Howe, director of education at RStudio, a company that provides open source and enterprise tools for use with the R programming language. "Factories are always looking to improve production yields — if you can improve yield by one to two percent, that can mean millions of dollars to a chip or drug manufacturer."

And while companies are working on automating data analytics, "around 80% of the job hasn't been automated, and the 20% that is being automated still isn't automated really well," says Matthew May, lead data scientist at URSA. "More importantly, any problem that auto-machine learning can solve is a 'softball problem.' Hard problems take one or more people to work on. So jobs doing data analytics aren't going away."

Data science, says Howe, "is about 'can we model the world — and use these models to make predictions,' while data analytics is more about extracting insights from big datasets. For example, 'we forecast the demand for oil will peak by 2030,' or 'the proportion of the world living in poverty has almost halved in the last 20 years' are both results from data analysis."

Salarywise, a Northeastern University report that cites Robert Half Technology's 2019 Salary Guide says, "Data analysts have an earning potential of between $81,750 and $138,000... Since these professionals work mainly in databases, however, they are able to increase their salaries by learning additional programming skills, such as R and Python."

What skills and experience do you need to succeed as a data analyst? To find out, we got advice from four data analytics professionals.

  • Carl Howe is a data scientist and director of education for RStudio, Inc., which provides open source and enterprise tools for use with the R programming language.
  • Dr. Kristen Sosulski is a professor of tech operations and statistics at NYU Stern and author of the book Data Visualization Made Simple: Insights Into Becoming Visual.
  • Matthew May is lead data scientist at URSA Inc. (for Unmanned Robotics Systems Analysis), an unmanned systems data analytics and visualization company that helps users unlock value from their telemetry data.
  • Dr. Rosaria Silipo is the principal data scientist at KNIME, which provides the KNIME Analytics Platform tool for data science. She's also the author of six books, plus dozens of other technical publications.
No alt text provided for this image

1. You have to like working with numbers

"Doing data analytics makes use of two skills," Howe says: "One, statistics, and two, telling a story with those statistics in ordinary words.

"If you're going to be a data analyst, you must know how to use statistical techniques accurately. You have to like and be good at working with numbers. You have to be able to see data like a mystery or puzzle, and think, 'There's something in here that I want to discover.' Then you apply your math skills to find clues and eventually solve that mystery."

But that's only half the story. Jobs in data analytics focus not only on the numbers but also on how we communicate insight, Howe says. "You're turning data and statistics into a story that can influence others. That story probably has to be told in pictures, because that's the way we internalize information quickly."

Conveying the meaning of results in a way that can be quickly and easily grasped is essential. You have to be very descriptive in the work you do, be able to visually communicate what you've learned from your analysis — for example, creating charts and graphs, and solidly interpreting them, using the data as evidence.

In general, says May, "You have to be curious and inquisitive, and enjoy not knowing how to solve a problem, not knowing the answer, and working through that to get to a usable solution or usable actionable answer."

Beyond this, May says, "There are varying levels of technical ability. Statistics helps. So does having a behavioral analysis background." This is why i recommend starting your Data Science journey with Quantum Analytics NG, because this is included in their training plan.

Behavioral analysis "is concerned with describing, understanding, predicting, and changing behavior." That, May says, "is a big part of data science/analytics. Often we are looking to be able to influence user behavior, i.e., ‘get them to click on this, or that’ or answer the question ‘why did the user click on this, or that?’ So having a background in a science that is all about behavior can be very useful."

2. You have to know how to code, but you don't have to know computer science

Along with a love for numbers, data analysts need an affinity for working with them programmatically.

"You should learn to code, for reproducibility so others can build on what you've done," Howe says. If you can't write down a program that does what you are doing, "you're left with two choices: teach others how to do it or keep doing it yourself forever."

What computer languages and other software tools are most likely to be useful for a data analyst? SQL is essential — it's the standard language for data manipulation. Other useful options:

  • Python. "Pick a graphing library in Python and get to where you're pretty good at it," May says, "and learn Pandas, the Python Data Analysis library."
  • R, a free software environment for statistical computing and graphics. "R is written specifically for data analytics and science," RStudio's Howe says. However, URSA's May says if you don't already know R, it may not be worth your time to learn it. "If you mostly deliver one-off answers that don't get put into production, R is OK, but If you're shipping code to production, R gets really tricky."
  • Hadoop, a collection of tools for processing large datasets, and Spark, a fast and general cluster computing system for big data. "I think anyone who gets hired as a data analyst and can't wrangle and clean data will struggle in a real-world environment because at least 90% of an analyst's work is cleaning and transforming data. If your data set is large enough that you can't process it on your laptop, you need big data (and usually cloud-based big data) skills such as Hadoop and Spark," says Howe.
  • Cloud-native and desktop analytics platforms, such as Looker, Tableau, and Microsoft Power BI. However, notes Carl Howe, "Many of those tools are simply cloud-based versions of point-and-click visualization tools, which rely on manual and irreproducible processes for analyzing data. If you're an analyst who knows how to use a programming language, you'll have no trouble picking up those tools if you need them. On the other hand, if your skills are primarily in the point-and-click world, you'll find it difficult to make the transition to a code-based analysis environment, which is where hardcore data analysts work." We recommend training with Quantum Analytics NG for a strong foundation in programing.
  • Excel. Howe says, "Data scientists and analysts love to disparage Excel, but the reality is that many businesses run on Excel data. A good data analyst can build a dialog with end users and find ways to work with those users, and in many businesses, that may mean working with Excel data.

That said, while knowing how to code and knowing a programming language or three is essential to being a data analyst, coding for data analytics doesn't require the same depth of knowledge required for a degree in computer science.

"Data analytics and computer science are different disciplines," Howe says. "Data analytics is more about understanding large datasets. In a computer science course, you'd be introduced to the concept of loops and loop statements, but in data analytics, you might not encounter this concept until the end, because data analytics operations process a whole set at once; looping is only used rarely. So while a data analyst needs to be able to write code, they don't necessarily need a computer science background."

No alt text provided for this image

3. Communication skills are (almost) as important as math

You may have the technical chops to handle data analytics, but that might not be enough to get hired. What else do you need to ace an interview?

"One, make sure you can talk your way through a number of machine learning algorithms," says URSA's May. "Two, be able to speak to the bias-variance tradeoff in prediction models — and what you can do to/about it using SQL. And three, be able to talk through an end-to-end data science or data analytics problem that you've solved — what the problem was, your solution, and how you dealt with the roadblocks you encountered along the way."

Silipo says, "I look for many different things when I run interviews for these people and positions. First of all I look for technical skills. I give them an exercise and see how they approach it, how their way of thinking is, and whether they have the right math background. This applies to both data analytics and data science.

"Then I check their communication skills. It's true that a data analyst’s role, like that of a data scientist, is mainly technical, but for both roles, a minimum level of communication will be required to explain the results of a project or even to promote the project itself."

Communication skills covers a range of factors. Do you have a design sense, to create visualizations? Can you communicate with non-technical colleagues?

"And then finally, and most importantly, I check their attitude," says Silipo. Data science is constantly evolving and there will be new concepts and new algorithms to learn every year. A curious attitude is what I need. I need somebody who isn't afraid of saying, 'I don't know. I'll research it.' It's impossible to know everything in the data science space, so a healthy humble attitude mixed with a self-starting curious attitude is the right combination."

But attitude only goes so far. "People want to see demonstrable evidence of your skills," NYU's Sosulski says. "How do you do this? Build portfolios of data projects from start to finish."

It's good to have one — and it shouldn't just be code that you've written. You should have write-ups to accompany your code, using words to explain — in a concise way — what you did. Code can get pretty long, and can take a while to digest if you don't comment it correctly. And even then, nobody has hours to dig through your code. So you have to be able to explain what you did.

Your portfolio should have at least two classification problems where you use different algorithms, and two regression problems where you use different algorithms. And all of these problems need to have the proper data science workflow.

In terms of the datasets you use in your portfolio, "Use some nice clean datasets — but if you can, get your hands on at least one that's very dirty and raw, so you can show what you did with missing values — do you fill them in or remove them." And how can you find data and projects to work on? Many open source datasets are available online.

4. Much of what you'll do won't be at the top of the job description

You may have an intuitive idea about what a data analyst does, but what you imagine might not line up with how you actually spend your time. URSA's May says, "Mostly, you'll be thinking about a problem or question, and how you can use data to potentially solve or answer that. And you'll be doing EDA — exploratory data analysis — which means seeing if you can find a signal that can help answer that question or problem."

Some of this may get done on a blackboard (traditional or digital), and some with coding, May says. "You do EDA by writing code. There's also writing the code for the model that you build, and the code to create the graph to show your answer, and the code for the statistic you're going to derive."

Once you think you know how to answer a question, you still have a ways to go before you create a report or a visualization. "One irony of both data science and analytics is that while you need to know a great deal about models and machine learning, you will usually spend as much as 90% of your time cleaning real-world data before you analyze it. It's the old story of 'garbage in, garbage out,'" says RStudio's Howe. "You need clean data to work with before you can model it."

Cleaning the data, Howe says, includes:

  • finding things that are clearly errors, are coded badly, or exhibit transcription problems
  • transforming data into a consistent and meaningful format. Times and dates, for example, may be represented in many different ways with many different reference points, "so if you have a dataset of measurements that were recorded at local times around the world, all of those have to be converted to a standard such as Greenwich Mean Time before you can analyze them."
  • Deciding what to do with values that are left out, like temperature measurements with a faulty sensor. "Do you leave them out of your analysis?" says Howe. "Extrapolate from other nearby values? Replace them with the average value? There is no right answer — all will introduce bias into the results."
No alt text provided for this image

Conclusion: So you want a job in analytics

Learning the skills you need to get a job in data analytics won't happen in a month — depending on your background and goals, expect to spend some time or more.

But if you have a mathematical mindset and no fear of coding, becoming a data analyst should be within your reach. To help you on your way, reach out to Quantum Analytics NG for learning and doing data analytics.

To begin your data analytics journey as a certified Data Analyst follow Quantum Analytics NG on LinkedIn & signup to their training programs Here.

Follow Quantum Analytics NG on Twitter

____________________

Jonathan Osagie is a Data & Business Intelligence Analyst interested in understanding stories that data tells, actions it inspires through analysis & analytics, and the success it brings to businesses by aiding strategic decisions.

Follow me on Twitter.

#BecomeGloballyInDemand

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics