{"id":104,"date":"2020-08-18T01:27:23","date_gmt":"2020-08-18T01:27:23","guid":{"rendered":"http:\/\/localhost\/dwbapps\/?page_id=104"},"modified":"2021-01-15T05:23:01","modified_gmt":"2021-01-15T05:23:01","slug":"screenshots","status":"publish","type":"page","link":"https:\/\/thedatapup.com\/home\/index.php\/data-pup\/screenshots\/","title":{"rendered":"Data Analysis with Data Pup"},"content":{"rendered":"\n<p>The following screens demonstrate the exploratory data analysis features of Datapup when using it in Python workspace mode.<\/p>\n\n\n\n<p style=\"font-size:12px\">NOTE: The following guide uses data from data.gov.au and is licensed under <a href=\"http:\/\/www.opendefinition.org\/licenses\/cc-by\" data-type=\"URL\" data-id=\"http:\/\/www.opendefinition.org\/licenses\/cc-by\">Creative Commons Attribution<\/a>:<br>&#8211; Airport traffic data <a href=\"https:\/\/data.gov.au\/dataset\/ds-dga-cc5d888f-5850-47f3-815d-08289b22f5a8\">https:\/\/data.gov.au\/dataset\/ds-dga-cc5d888f-5850-47f3-815d-08289b22f5a8<\/a><br>&#8211; NSW COVID-19 cases by age range <a href=\"https:\/\/data.gov.au\/dataset\/ds-nsw-3dc5dc39-40b4-4ee9-8ec6-2d862a916dcf\/details\">https:\/\/data.gov.au\/dataset\/ds-nsw-3dc5dc39-40b4-4ee9-8ec6-2d862a916dcf<\/a><\/p>\n\n\n\n<p style=\"font-size:10px\"><\/p>\n\n\n\n<p>First, create a python workspace, here called &#8220;example&#8221;.  Specify a file where you will put your Python script code.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"798\" height=\"496\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss1.png\" alt=\"\" class=\"wp-image-386\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss1.png 798w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss1-300x186.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss1-768x477.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Click on DataFrames &gt; Python Script, or click on the code button shown below to access the Python editor window:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"877\" height=\"555\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss2.png\" alt=\"\" class=\"wp-image-387\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss2.png 877w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss2-300x190.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss2-768x486.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>This window allows you to enter Python code that creates Pandas DataFrames.  Any variable that is a dataframe will be available for querying.  Here I load a csv file into the variable df_pax.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"878\" height=\"651\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss3.png\" alt=\"\" class=\"wp-image-388\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss3.png 878w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss3-300x222.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss3-768x569.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>DataFrames appear on the right.&nbsp; Right click on one, and you can do standard EDA operations like correlations, data types and describe from the Python Pandas library.&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"891\" height=\"505\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss5.png\" alt=\"\" class=\"wp-image-390\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss5.png 891w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss5-300x170.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss5-768x435.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"875\" height=\"549\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss6.png\" alt=\"\" class=\"wp-image-391\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss6.png 875w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss6-300x188.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss6-768x482.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Next, write your SQL directly in the SQL window, and execute using the Run button, or Ctrl-Enter or Shift-F9.&nbsp; (Press Ctrl-H to see all shortcut keys).<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"882\" height=\"544\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss8.png\" alt=\"\" class=\"wp-image-393\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss8.png 882w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss8-300x185.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss8-768x474.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p><br>Right click on the results, or use the buttons on the right, to run commands such as correlation and describe on the output table. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"888\" height=\"531\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss9.png\" alt=\"\" class=\"wp-image-378\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss9.png 888w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss9-300x179.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/ss9-768x459.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<h2><em>Plots and counts<\/em><\/h2>\n\n\n\n<p>Right click on a column header in the results, to run commands such as box plots and scatter plots.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"876\" height=\"538\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss1.png\" alt=\"\" class=\"wp-image-384\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss1.png 876w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss1-300x184.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss1-768x472.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Here I&#8217;ve chosen Int_pax_in for the Y axis value.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"915\" height=\"641\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss2.png\" alt=\"\" class=\"wp-image-385\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss2.png 915w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss2-300x210.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/pcss2-768x538.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<h2>Importing CSV using the GUI<\/h2>\n\n\n\n<p>Back to the main window, from the menu DataFrames &gt; Import CSV, you can import CSV files without writing Python.  Here I&#8217;ve given the DataFrame a name of covidnsw, and accepted defaults.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"878\" height=\"547\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss1.png\" alt=\"\" class=\"wp-image-394\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss1.png 878w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss1-300x187.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss1-768x478.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Data Pup has automatically updated the python script (since I left the update python script option checked)<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"887\" height=\"644\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss2.png\" alt=\"\" class=\"wp-image-379\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss2.png 887w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss2-300x218.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss2-768x558.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Back in the main window, add some additional SQL to query the new DataFrame covidnsw, and execute it.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"884\" height=\"553\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss4.png\" alt=\"\" class=\"wp-image-381\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss4.png 884w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss4-300x188.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss4-768x480.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<p>Right click on column age_group and choose value counts.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"877\" height=\"545\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss5.png\" alt=\"\" class=\"wp-image-382\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss5.png 877w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss5-300x186.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss5-768x477.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"871\" height=\"536\" src=\"http:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss6.png\" alt=\"\" class=\"wp-image-383\" srcset=\"https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss6.png 871w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss6-300x185.png 300w, https:\/\/thedatapup.com\/home\/wp-content\/uploads\/2020\/10\/impss6-768x473.png 768w\" sizes=\"(max-width: 767px) 89vw, (max-width: 1000px) 54vw, (max-width: 1071px) 543px, 580px\" \/><\/figure>\n","protected":false},"excerpt":{"rendered":"<p>The following screens demonstrate the exploratory data analysis features of Datapup when using it in Python workspace mode. NOTE: The following guide uses data from data.gov.au and is licensed under Creative Commons Attribution:&#8211; Airport traffic data https:\/\/data.gov.au\/dataset\/ds-dga-cc5d888f-5850-47f3-815d-08289b22f5a8&#8211; NSW COVID-19 cases by age range https:\/\/data.gov.au\/dataset\/ds-nsw-3dc5dc39-40b4-4ee9-8ec6-2d862a916dcf First, create a python workspace, here called &#8220;example&#8221;. Specify a file &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/thedatapup.com\/home\/index.php\/data-pup\/screenshots\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Data Analysis with Data Pup&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"parent":52,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":[],"_links":{"self":[{"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/pages\/104"}],"collection":[{"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/comments?post=104"}],"version-history":[{"count":5,"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/pages\/104\/revisions"}],"predecessor-version":[{"id":518,"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/pages\/104\/revisions\/518"}],"up":[{"embeddable":true,"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/pages\/52"}],"wp:attachment":[{"href":"https:\/\/thedatapup.com\/home\/index.php\/wp-json\/wp\/v2\/media?parent=104"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}