Data Analyst portfolio
DATA ANALYST
PORTFOLIO
BY SARAH
4/13/2022
DATA ANALYST PORTFOLIO
In order to get to the root cause of a business problem, factors like data, facts, trends,
environment, management, customers and products will be spotlighted in arriving at
set goals and objectives. I have the passion, patience and the skill set to analyses the
contributing factors, generates interventions, examine solutions in order to execute
the necessary steps in solving problems.
PROFESSIONAL BACKGROUND
Xyz has over 10 year’s professional background in business
management. My Educational and career background has been
centered on business management with specialized focus on
business analysis, business intelligence and Administrative
management.
EDUCATION
CERTIFICATIONS
Entry Level
➢ Data Analyst certification (April, 2022)
➢ Google project management certificate (March, 2022).
SALFORD UNIVERSITY (Jan, 2012 to June 2013)
➢ International business
(M.Sc.)
Covenant university (Sept, 2006-July, 2010)
➢ Business Administration (B.sc)
EXPERIENCE
Lukah exchange bdc ( Jan 2014-Jan 2022)
➢ Business intelligence Analyst
Provista Consulting Ltd (September,2013-january 2014)
➢ Business operation and management.
National Car park Manchester (December, 2012-May 2013)
➢ Customer Correspondent Advisor and virtual Assistant
1
TABLE OF CONTENT
Professional background
1
Table of content
2
Udemy project describtion
3
The problem
4
Design
5
Findings
6
Findings 1
6
Figure 1. Total subscriptions by category
6
Figure 2. Users by Skill level
6
Figure 3 Avarage content duration
Findings 2.
6
7
Table 2. Sum Subscriber by Subject
7
Table 3 Average Subscriber by Subject
7
Figure 3 Average Subscriber by Subject
8
Figure 4 Sum of reviews vs. subscriber
Figure 5 Average of reviews vs. subscribers at each level
9
9
Analysis
9
Conclusion
10
Project Description
12
Problem
13
Data Design
Findings
14
15-21
Data Analysis
22
Conclusion
23
2
UDEMY PROJECT DESCRIPTION
UDEMY PROJECT DESCRIPTION
Tasked by the manager, head of curriculum at Udemy to present data
on course revenue at Udemy. Data from different courses was
collected in order to know the following
➢ determine opportunities for revenue increase
➢ Track the performance of courses offered.
The manager gave duration of three weeks for analysis and report
preparing for onward submission to the CEO on how they will increase
their quarterly earning
In order to arrive at an optimal conclusion, the manager asked to
focus on the top ten courses offered at udemy for analysis and
presentation of report. The top twenty courses were thoroughly analyzed
to understand which courses are subscribed the most and why were they
subscribed, in determining the whys, I wanted to know how many
courses were free/not free, content of the course as well as the number of
reviews to determine its popularity.
Analyses were performed using pivot tables and tableau for
virtualization of data to give a clear picture of trends.
In conclusion, results from analysis clearly shows that web
development was subscribed to the most, the categories of subscribers of
web development were grouped into paid and free (no payment), their
levels (beginners, intermediate and advanced courses) these categories
top the charts in both pivot tables and tableau.
3
THE PROBLEM
The business is looking for opportunities to increase
revenue as well as track the performance of courses that are
offered in Udemy.
The project was given a three week time frame to present
the findings and analysis to the manager, head of curriculum.
To understand these problems data on all Courses offered in
udemy was collected from course file, attention was placed on
collecting all the course data on, duration of courses, content of
courses, review made by people on courses, given ratings and
price paid for courses such as web development, music,
business and design was gathered.
For better understanding of the business problem, questions
were asked to get to the root course. Question such as total
subscribers for individual courses, averages of ratings and
reviews received and lastly how many courses were offered free
and how many was paid for. Information gathered was used to
present an overview of trends and fact so that decisions are
made factually.
4
DATA DESIGN
Data collected had to be imported from different file location as a result
cleaning and alignment had to be done;
➢ Empty/blank roles were detected and removed using Google sheet.
➢ Column for web development was not properly presented so it had
to be rewritten using ‘find and replace’ in Google sheet.
➢ Duplicates were detected and removed via data on Google sheet.
Google sheet was used in cleaning and presenting data on the total
number of subscribers and their averages.
TABLEAU
Tableau was used to create reports and dashboard for visualization
and analysis. I used tableau to support my findings after analysis
with the pivot tables.
5
FINDINGS
This section captures the findings from the analysis conducted to seek
out opportunities to increase revenue and track the overall course
performance.
Figure 1.
Total number of subscribers per subject
SUM of num_subscribers-
-
Business Finance
-
Graphic Design
846689
Musical
Instruments
Web
Development
Figure 1 clearly show that web development has the highest number of
subscriber.
Figure 2.
Total number of rating by users vs. their skill level-
level All Levels
level Beginner Level
level Expert Level
level Intermediate
Level
level Grand Total
Figure 2 clearly shows that graphic design (subjects) had more review;
the highest concentration was on expert level of graphic design.
6
Figure 3 Average content duration vs. subjects offered
subject
AVERAGE of content_duration vs
subject
Business Finance
AVERAGE of
content_duration
Graphic Design
Musical Instruments
Web Development
0
2
4
6
AVERAGE of content_duration
Findings 2
Table 2
Average number of subscribers by subject
subject
AVERAGE of
num_subscribers
Business Finance
-
Graphic Design
Musical
Instruments
Web
Development
-
Grand Total
-
-
Table 3
Sum of reviews by subject
subject
SUM of num_reviews
Business Finance
75902
Graphic Design
37070
7
Musical
Instruments
31724
Web Development
429899
Grand Total
574595
Figure 3 SUM OF REVIEWS VS SUBCRIBERS
Figure 3 shows the sum of reviews vs. subscribers of courses that are
offered freely and those that are paid for.
8
Figure 4: AVERAGE NUMBER OF REVIEWS VS SUBSCRIBERS AT
EACH LEVEL
9
DATA ANALYSIS
The manager, head of curriculum wants to know opportunities for
increased revenue in the second quarter and course performances.
Why course performance analysis?
Courses performance analysis was carried to know which courses are
subscribed to the most, figure 1 clearly shows that web development has
the highest number of subscribers.
Why know the number of subscribers?
In determining the total number of subscribers to the courses, it was
discovered that students who enrolled for the various courses, did so
because of some factors such as content, reviews, rating and some
courses were paid for and some were offered free.it is clearly shown that
the highest course subscribed to which was web development was
offered free.
Why was web development more subscribed than the other
courses?
Web development had the highest number of content and most of them
were offered free at all levels, but it came second when it came down to
the ratings. Graphic design had the highest rating at the expert level.
Why know why students are enrolling for the courses
Students were enrolling for web development courses because most were
free at all level, had more content and reviews.
10
CONCLUSION
After careful analysis here are some suggested actionable considerations
➢ Graphic design had the highest ratings of all subject but had a
lower course content in comparison to web development, that
means there is an opportunity to increase revenue
➢ Web development course are in demand therefore Udemy can take
advantage of its marketability because of its high content rating,
reviews and subscription.
11
PROJECT DESCRIPTION
XYZ SALES DATA BY PRODUCT LINE
The company is looking to expand its capacity to deliver more trains
to its customers after recently partnering with train go ltd, a major
manufacturing and assembly train company located in America. The
company has two months to sign its agreement with train go ltd, Xyz
company wants to know its current capacity and offerings based on its
customers, their countries and their deal size in order to leverage on its
newly acquired partnership.
Xyz operations manager wants to know the following;
➢
➢
➢
➢
Its major customers/clients for trains
Its deal size
The rate of shipping
The relation between sales and quantity ordered.
The time given for the completion of analyses was four weeks. In
executing this task, sales data was collected, covering the customers,
their countries, deal size, product lines, product line that was shipped,
cancelled and those placed on hold.
Analyses were performed using pivot tables and tableau for
virtualization of data to give a clear picture of trends.
In conclusion, after analysis has been carried out, results shown that out
of all the product line such as classic cars,motorcycles,planes,ships,
trucks and buses and vintage cars, trains was lower in demand.
12
THE PROBLEM
The XYZ Company is looking to boost its capacity by
partnering with train go ltd a manufacturing and assemble
Train Company. The company wants to know if the partnership
is profitable and viable to its operating success.
The project was given a month time frame to present the
findings and analysis to the manager, head of operations.
To understand these problems data on all sales, product line,
deal size, status, quantity ordered, list of customers and their
individual countries was gathered.
For better understanding of the business problem, questions
were asked to get to the root course. Question such as what is
the current deal sizes of the product line, which countries are
mostly ordering trains, what is product line are mostly ordered.
Information gathered was used to present an overview of trends
and fact so that decisions are made factually.
13
DATA DESIGN
Data was collected from the sales department and was cleaned by
➢ Removing duplicates
➢ Removing empty rows
Excel sheet was used in cleaning and presenting data on the total
number of subscribers and their averages.
TABLEAU
Tableau was used to create reports and dashboard for visualization
and analysis. I used tableau to support my findings after analysis
with the pivot tables.
14
FINDINGS
After data has been collected and cleaned, analysis into the data show
the followings
FINDINGS 1
FIGURE 1: SUM OF SALES OF ALL PRODUCT LINE BY COUNTRIES
Total-
-
Total
-
USA
UK
Switzerland
Sweden
Spain
Singapore
Philippines
Norway
Japan
Italy
Ireland
Germany
France
Finland
Denmark
Canada
Belgium
Austria
Australia
0
15
Figure 2 sales for all product line
FIGURE 3: COUNT OF SALES FOR TRAIN BY COUNTRIES
Trains
30
25
20
15
10
Trains
5
USA
UK
Sweden
Spain
Singapore
Norway
Japan
Italy
Ireland
Germany
France
Finland
Denmark
Belgium
Australia
0
16
Table 1 count of sales by countries
COUNTA of
SALES
COUNTRY
Australia
Belgium
Denmark
Finland
France
Germany
Ireland
Italy
Japan
Norway
Singapore
Spain
Sweden
UK
USA
Grand
Total
PRODUCTLINE
Trains
Grand Total-
-
The highest count of sales were from USA and Spain.
Findings 2
Figure 4: Sales vs. quantity ordered over time for train
Figure 5: sales vs. deal size by product line.
17
Trains had the lowest deal sizes (medium, Large and small) according to
sales.
18
Finding 3.
Figure 6: trains that was ordered and shipped by customers in each
countries.
19
Table 2: product line that was ordered and cancelled.
STATUS
SUM of
QUANTITYORDERED
COUNTRY
Spain
Sweden
UK
USA
Grand Total
Cancelled
PRODUCTLINE
Classic
Cars
Planes-
Ships
Trains
-
42
42
Vintage
Cars-
Grand
Total-
Only USA cancelled its order.
20
Figure 7: Trains that was cancelled shipped and on hold by customers.
Cancelled, On Hold and Shipped-
STATUS Shipped
150
STATUS On Hold
STATUS Cancelled
100
50
0
CUSTOMERNAME
Figure 8: sum of quantity ordered over a period of time for train-
2005
2004
USA
UK
Sweden
Spain
Singapore
Norway
Japan
Italy
Ireland
Germany
France
Finland
Denmark
Belgium
Australia
2003
21
DATA ANALYSES
Finding the root cause
➢ Why does the company want to partner with train go?
To increase its capacity and get more sales for train and increase
its deal size, from the analysis in figure 2, it clearly shows that
trains were the lowest product line ordered.
➢ Why does is train the lowest quantity ordered? It shows in
findings 2, in figure 4, the quantity ordered over time has declined
in June 2003, June 2004 and June 2005.
➢ Why did train quantity decline in the month of June? USA
cancelled its order in June 2004, also shown in the figure 8 only 4
countries ordered for train in 2005, these countries were USA,
Spain, Japan and France
➢ Why was there decline?
Most clients who ordered for trains transacted on a small and
medium sale deal, only France transacted on a large scale
➢ Why do most customers and client demand for it on a small and
medium scale?
This is because the company is yet to expand its capacity
22
CONCLUSIONS
OBSERVATIONS AND ACTIONABLES
➢ More investigation can be conducted to know why USA cancelled
its order
➢ Partnership with train go ltd can be viable since most quantity
ordered are rarely cancelled and put on hold.
23
APPENDIX
➢ Google sheet data set for Udemy project
https://docs.google.com/spreadsheets/d/1_3d_1KrS0l
w39DmZIqiN6pOTeCgXGZTF2iSAsFZL7JE/edit?usp=s
haring
➢ Tableau visualization for Udemy project
https://public.tableau.com/views/numberofreviewsvssu
cribers/Sheet4?:language=enUS&:display_count=n&:origin=viz_share_link
➢ Google sheet for sales data by product line
https://docs.google.com/spreadsheets/d/1dJF8-94HelcU2Phud57MYJnrSCfnxOjr26DQFGXPIE/edit?usp=s
haring
➢ Tableau visualization for sales data
https://public.tableau.com/views/salesvsquantityordere
doveraperiodoftime/Sheet5?:language=enUS&:display_count=n&:origin=viz_share_link
24
25