{"id":19,"date":"2022-11-10T09:36:39","date_gmt":"2022-11-10T09:36:39","guid":{"rendered":"https:\/\/dcarr-projects.com\/?p=19"},"modified":"2022-11-10T09:36:40","modified_gmt":"2022-11-10T09:36:40","slug":"the-prepare-phase","status":"publish","type":"post","link":"https:\/\/dcarr-projects.com\/index.php\/2022\/11\/10\/the-prepare-phase\/","title":{"rendered":"The &#8220;Prepare&#8221; Phase"},"content":{"rendered":"\n<p>The next phase of the analysis is to prepare the data that will be used. Some important things to determine when conducting such analysis are the credibility of the data and if it is used under proper license. <\/p>\n\n\n\n<p>In reality, this is publicly available data collected by Motivate International, which Coursera compiled for the purposes of this project. For the sake of this scenario, we will say that this is raw data which was provided to us by Cyclistic&#8217;s IT Department.&nbsp;<\/p>\n\n\n\n<p><strong>The Goal: <\/strong>Determine how casual riders use Cyclistic bicycle services differently than riders with an annual membership.<\/p>\n\n\n\n<p><strong>The Data: <\/strong>We used data collected by the company on each ride that was taken for the year of 2021. This raw data was stored on a web page built specifically to house it. Since this data is all in-house, it can be considered a primary source and is credible to use. This data was compiled into spreadsheets for each month of the year, with titles like \u201c202104-divvy-tripdata.zip\u201d assigned to them. I unzipped the files and placed them into a folder named \u201c2021 Raw Trip Data\u201d, located within a main folder on my desktop named \u201cMarketing Analysis\u201d. Next, I renamed all the files using the format \u201c2021-04-raw-tripdata.csv\u201d to make the dates more legible. We will keep all of our raw spreadsheets in this folder, and place an edited version of each sheet into a folder we will call \u201c2021 Edited Trip Data.\u201d The edited files will all be converted into CSV files that we will use with Excel, in order to avoid conflicts down the road when uploading datasets from the R program. <\/p>\n\n\n\n<p><em>*Note: I encountered problems working with Excel sheets using R programing in a later part of the project, so we will instead do our work using CSV format on our files.&nbsp;The Excel Workbook files will still be kept in the project folder, in case someone else would like to work with them.<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" width=\"793\" height=\"426\" src=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-raw-files.png\" alt=\"\" class=\"wp-image-137\" srcset=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-raw-files.png 793w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-raw-files-300x161.png 300w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-raw-files-768x413.png 768w\" sizes=\"(max-width: 793px) 100vw, 793px\" \/><\/figure>\n\n\n\n<p>Opening up the file for January, we can see the different columns that this data is divided up into. The \u201cride_id\u201d column lists the unique id assigned to each ride. The \u201crideable_type\u201d column shows what kind of bicycle they used.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" width=\"471\" height=\"396\" src=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-ride_id-and-rideable_type.png\" alt=\"\" class=\"wp-image-138\" srcset=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-ride_id-and-rideable_type.png 471w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-ride_id-and-rideable_type-300x252.png 300w\" sizes=\"(max-width: 471px) 100vw, 471px\" \/><\/figure><\/div>\n\n\n<p>There are columns for the time that each ride began and when it ended, as well as locational data regarding where the ride began and where the bike was dropped off. This locational data consists of the name of the stations where the bikes were picked up and dropped off, as well as the stations\u2019 id and longitude and latitude coordinates. <\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" width=\"951\" height=\"348\" src=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-coordiantes-and-names-showcase.png\" alt=\"\" class=\"wp-image-139\" srcset=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-coordiantes-and-names-showcase.png 951w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-coordiantes-and-names-showcase-300x110.png 300w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-coordiantes-and-names-showcase-768x281.png 768w\" sizes=\"(max-width: 951px) 100vw, 951px\" \/><\/figure>\n\n\n\n<p>The last column offers data crucial to our analysis, with \u201cmember_causal\u201d showing whether the rider was a casual customer or an annual member.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img loading=\"lazy\" width=\"521\" height=\"216\" src=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-member_casual.png\" alt=\"\" class=\"wp-image-140\" srcset=\"https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-member_casual.png 521w, https:\/\/dcarr-projects.com\/wp-content\/uploads\/2022\/09\/Excel-member_casual-300x124.png 300w\" sizes=\"(max-width: 521px) 100vw, 521px\" \/><\/figure><\/div>\n\n\n<p>All this data put together offers enough material from which to build an analysis of the&nbsp; different ways these customers use our bikes.&nbsp;<\/p>\n\n\n\n<div class=\"wp-container-1 is-content-justification-center wp-block-buttons\">\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link\" href=\"https:\/\/dcarr-projects.com\/?p=15\">Previous Page: Ask Phase<\/a><\/div>\n\n\n\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link\" href=\"https:\/\/dcarr-projects.com\/?page_id=11\">Main Page<\/a><\/div>\n\n\n\n<div class=\"wp-block-button\"><a class=\"wp-block-button__link\" href=\"https:\/\/dcarr-projects.com\/?p=22\">Next Page: Cleaning<\/a><\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>The next phase of the analysis is to prepare the data that will be used. Some important things to determine when conducting such analysis are the credibility of the data and if it is used under proper license. In reality, this is publicly available data collected by Motivate International, which Coursera compiled for the purposes&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false},"categories":[3],"tags":[],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/posts\/19"}],"collection":[{"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/comments?post=19"}],"version-history":[{"count":8,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/posts\/19\/revisions"}],"predecessor-version":[{"id":338,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/posts\/19\/revisions\/338"}],"wp:attachment":[{"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/media?parent=19"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/categories?post=19"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dcarr-projects.com\/index.php\/wp-json\/wp\/v2\/tags?post=19"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}