Week 9 Report

We are officially in the second half of the semester, meaning we can pick a new topic. From week 7’s report I believe, I mentioned possible topics that I would like to look at.


Update, I was referring to the content in post “10/19 well” about a new topic. I think I would like to look at the “Movie Revenue and Rating Prediction” data from site: https://www.interviewquery.com/p/regression-datasets-and-projects


Happy Halloween!!!

Today I am looking at the TMDB 5000 Movie Dataset. I had read through the page a little last class.

This comes from the page itself. I thought “The Host” being most common was interesting. I was unsure if this means it repeats the most among the list as there are only 3 repeats; or if it is the movie that they predicted to make the most.

Dr.Davis suggested finding out how they predicted things and get a better starting point or answerable question for later in the semester when it comes report time again.

Week 7 Report

10/13/15 Indigenous People’s Day

No class.


A week exactly til the report is due. This is unsettling. Uncertain of my starting point. I’ve yapped a lot but I didn’t have a main pinpoint question to focus on. Dr.Davis suggested looking at the stuff that’s here so far and pick a focus.

Nicole mentioned working together for the paper so I have just spent some time reading through her posts. I appreciate all her graphs and general organization of data. It feels easy to follow and I can see a way to bring our information together in a sense.

I think we should focus on mental health and fleeing. Nicole has already made some graphs related to the two and I had started looking at the fleeing status myself previously. I think our question should be something of the two presumably but need to think about it further.


 

Week 6 Report

The mid semester report is due in exactly two weeks from today. I made a reminder for today to get a progress check. As the paper is due on Wednesday the 22nd, I thought it would be good to review the requirements for the report. It feels similar to a lab report in the structural set up and the parts that are required.

I have emailed about Dr.Davis reading the “weekend work” posts as I wonder how to proceed from where I am at. I believe there is a possible correlation between the event, but unsure how to investigate this deeper.


I have decided to try and spend more time over this coming weekend to do more understanding of last week’s findings.

Week 5 Report

Happy October! Today is October 1st. Mid semester report is due in 3 weeks and the semester is progressing steadily. I feel as though I have not made thorough progress as I don’t have a background in programming. Today’s class was focused on everyone sharing their questions and progress in answering said questions. When Erika shared, she said that she had a rather specific question and continually broke it down into smaller and more manageable questions. I like this approach as it can get into the “nitty gritty” of things while not being the most “complex” approach. I would like to apply this thought of breaking it into manageable chunks to my own analysis.


The areas that I am most interested in searching are the threat_type, flee_status, armed_with, and race categories. I believe these four have possibility to overlap and connect in ways that can be searched or analyzed deeper in my search. I’d like to use R and R Studio to create the connections and see the possible areas of overlap.

I think starting with a breakdown of each category would be good with the following questions:

-Given the threat types; what is the majority of the population qualified as?

-Given the flee status; what is the predominate response?

-Given the types of things people were armed with; what is the most common weapon, if any?

I don’t have a specific question pertaining to race because as mentioned in class that it should be scaled since the population of different races vary this will be a later question rather than a starting point.


On Friday, I got to really start using R and RStudio. I had both downloaded, but had not opened RStudio so I was unable to get the data open initially. I really appreciate the assistance and the start to getting into breaking down the data.

Week 4 Report

Another Monday! Today in class we got to see some of what people have been working on and finding. Nicole presented about her findings.

Wednesday’s class was supposed to be focused on R and RStudio. Using the introduction to R page we were supposed to try playing around in the program.

Today starts with spurious correlations. We are looking at various graphs that have two variables that are seemingly entirely unrelated, but their data appears similarly within a single graph. Tyler Vigen spurious correlation page uses AI a lot to write papers based off the data it presents.

Week 3 Report

Starting off with a webpage update since not everyone had their latest post showing first. Then onto the “Software application notes” from page 18. It has an introduction for various programs that we can use to look at and analyze data in this course. “The easiest way to learn is to start by doing it.”

Wednesday’s class we were supposed to start using Mathematica, but a lot of the computers don’t have it working because the wifi or other issues. But I was able to download it to my laptop so I will be trying to get things started on my end. I have also downloaded R and R Studio to my laptop as well. I believe one program was more user friendly while the other had more capabilities. As I have never used either before, I am going in with no prior knowledge for either.

The plan for Friday’s class was to use Mathematica to go over Samuel’s question about body cams. As we currently don’t have access because the student subscription has expired, the plan has been adjusted. Looking at the data from the table and doing a T-test to compare the means of the data. I would like to start using R and R studio to try and see its capability in understanding the data better.

Week 2 Report

On Monday, we talked about histograms and how to read data from them. The y-axis on the one we looked at was the percentage of ages in each “bucket” as Dr.D put it. This makes it so that the data is easy to compare from a second one. If there was a lot of overlap or very little, both are easily observable.

Task for Wednesday’s class is to come up with a question and email it. My question: is there a correlation between race and the “armed with” category? From this question there was other correlations that I got curious about. From the columns; threat_type, flee_status, and armed_with is there a correlation between theses three or even two of three if there’s any trend to notice. 

This Friday, we’ve discussed the difference between states and counties. Not every county has had a shooting in it as there are plenty of counties per state and the area as well as the population size of them varies. I enjoyed seeing Nicole’s graph and hearing she wants to focus more on the counties that had the most shootings.

Step 10!

Create a new post. This might count for the weekly assignment of writing a paragraph. Maybe it won’t and I am just rambling. But I don’t particularly mind. I really appreciate the no tests or quizzes aspect of this course. I also think I will really find interest in diving into our first topic. While police brutality and misconduct is not a light or fun topic, I already have a few questions of statistics that I would like to dive into when the time comes!

Also, Dr.Davis, it was really nice to meet you and I look forward to work in your course 🙂