top of page

Portfolio

Data Manipulation with Python

Cannot hear the video or hearing disability? Here is the transcript of the video: Hi there welcome to my latest project! Here I was working with ASDRP. They are a non-profit educational institution, that works with highschool students to do some real research with an advisor in all kinds of fields: biology, astronomy, quantum mechanics, you name it. I was working on their Research Directory, this spreadsheet over here, which contains information on all the research that they have done, the advisors, the name of the students, a little abstract on the research that they are doing and as you can hear it contains a lot of private data. So I made this script over here that generates some fake information, that way I can visually show you what I did on that spreadsheet, on the original spreadsheet. As you can see, the names and the emails of minors, the highschool students, are all mixed together in the same cell. While this one was the one that was generated by this script so it is somewhat uniform but the real one, some of them for example did not have the numbers or maybe they had a lot of extra spaces in between, or some of them didn't have the commas and so on. There were a lot of little things like that but in the long run, it made it very difficult since we had a lot of cells and lot of rows, and we had multiple spreadsheets as well. So what I did is I made this little script that goes to the "Authors" column, which is the one where the students would be at, and it separates the names to the emails and you can have them all clean and neat over here and that saves us a lot of time!

DATA DATA DATA

While working on the Google Advanced Data Analytics Certification, I had to perform on every single part of the data analysis process (across three companies; I only had choose one but I wanted to make sure I learned). There are around 6 reports on each!

 

From Project Proposal, through Exploratory Data Analysis, Tableau, A/B Testing, Multiple Linear Regression and Building the Machine Learning Model and finding a way to easily communicate it to a stakeholder. Here is how I did it, click the icons to see my reports! 

auto_edited_edited.jpg

Python Pokemon Videogame

Did I mention my programming background? My first contact with Excel was as a kid, and even though it might look like it, I did not fall in love with the monotonous cells and thought: I should be a Data Analyst! 

No, it started with python. I was  first introduced to python as a victim of my own curiosity and quickly fell in love with the deterministic nature of coding. Yet, it didn't feel like my thing. Until I discovered this same tool could cover millions and millions of individual data into something meaningful, beautiful, and well-grounded, like a scenic hill covered by flowers. 

g.png

Full Data Analysis process in SQL, Excel and R

In order to earn my Google Data Analyst Professional Certificate, I completed a case study for fictional company Cyclistic which rents bicycles in urban cities. In general terms, it assess the question how can we turn casual riders into affiliate members?

 

This project is special to me because it was the first time I was able to do data analysis from start to finish, from data collecting, through cleaning, merging, getting insights and finally presenting my work

 

Since you have already seen some of my SQL and Excel skills, you can check the full in detail report in R:

g.png
f.png
Urban Biking
bottom of page