You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
-[Speech and Language Processing, 3rd Edition](https://web.stanford.edu/~jurafsky/slp3/) Working version of Jurafsky, et. al. book on natural language processing whose content on n-grams is helpful for the capstone.
9
+
10
+
## Course Project
11
+
12
+
-[n-gram Computations and Computer Capacity](http://bit.ly/2couvxh) Explains the amount of memory required to convert the text files for the course project into n-grams, using the <strong>quanteda</strong> package.
13
+
-[Capstone Strategy](http://bit.ly/2rGcgc6) Describes a general strategy to get through the Capstone: use the simplest approaches possible.
14
+
-[Choosing a Text Analysis Package](http://bit.ly/2qagsPa) Reviews pros and cons of various R packages used for natural language processing, in the context of requirements for the Capstone project.
Copy file name to clipboardExpand all lines: getclean.md
+4Lines changed: 4 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -16,6 +16,10 @@ permalink: /getclean/
16
16
-[Codebook template that can be used in the Getting and Cleaning Data project](https://gist.github.com/JorisSchut/dbc1fc0402f28cad9b41)
17
17
-["Real world" example - reading American Community Survey 2000 PUMS Data:](https://github.com/lgreski/acsexample) Demonstrates how to extract records of a given type from a data file containing multiple record types, and how to use an Excel-based code book to specify arguments for reading a fixed-width file.
18
18
-[18 Months of CTA advice](https://thoughtfulbloke.wordpress.com/2015/08/31/hello-world)
19
+
-[Common Problems: Quiz 1 - Missing Java Runtime](http://bit.ly/2jjtyXM) Explains how to solve the problem of a missing Java Runtime for the question that requires students to process a Microsoft Excel spreadsheet.
20
+
-[Strategy for Reading Files & APIs / Quiz 2](http://bit.ly/2e4L5oF)
21
+
-[Common Problems: Quiz 2 - sqldf() driver fails to connect](http://bit.ly/2kD2KTY)
22
+
-[Tutorial: Downloading Files](http://bit.ly/2iP2suj) Illustrates various ways of downloading files, including binary and text files.
Copy file name to clipboardExpand all lines: rprog.md
+6-1Lines changed: 6 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -15,6 +15,7 @@ layout: page
15
15
-[Strategy for Coding the Programming Assignments](http://bit.ly/2ddFh9A)
16
16
-[Tutorial for those struggling with Programming Assignment 1](https://github.com/derekfranks/practice_assignment)
17
17
-[Breaking Down pollutantmean](http://bit.ly/2cHyiCl)
18
+
-[Assignment 1: A More Elegant Solution](http://bit.ly/2kwBBlK)
18
19
-[A SAS Version of pollutantmean?](http://bit.ly/2d3DR4e)
19
20
-[Tutorial for those struggling with Programming Assignment 2](https://github.com/DanieleP/PA2-clarifying_instructions)
20
21
-[Tutorial for those struggling with Programming Assignment 3](https://github.com/DanieleP/PA3-tutorial)
@@ -23,7 +24,7 @@ layout: page
23
24
-[Alternative submit script for Programming Assignment 1 that makes submitting more convenient by allowing selection of multiple parts plus prompting if user wants to submit another part before exiting](https://github.com/rchampoux/coursera/blob/master/rprog-scripts-submitscript1.R)
24
25
-[Grading the SHA-1 Hash Code](http://bit.ly/2iUWoB6)
-[Assignment 2: makeCacheMatrix as an Object](http://bit.ly/2byUe4e)
27
+
-[Assignment 2: makeCacheMatrix as an Object](http://bit.ly/2byUe4e)
27
28
28
29
29
30
## R Language
@@ -33,7 +34,11 @@ layout: page
33
34
-[S Objects, R Objects, and Lexical Scoping](http://bit.ly/2dtOSXi)
34
35
-[Common R Mistakes: Overwriting Functions with Data Objects](http://bit.ly/2i3gmoA)
35
36
-[Forms of the Extract Operator](http://bit.ly/2bzLYTL)
37
+
-[Functions to Sort Data Frames](http://bit.ly/2dxItzw)
36
38
-[Creative Use of R: Downloading Course Lectures](http://bit.ly/2bGlI7R) Article illustrating how to use R to automate the download of lectures from *Data Science Specialization* courses, such as *R Programming*. Techniques used in this article are helpful to make research reproducible, as required for courses like *Getting and Cleaning Data* and *Reproducible Research*.
39
+
-[Lexical Scoping and Statistical Computing](http://bit.ly/2cmqAPy) Article by Robert Gentleman and Ross Ihaka at the University of Auckland describing how lexical scoping works, and why it is valuable in statistical computing.
40
+
-[Data Science Job Report 2017: R Passes SAS, But Python Leaves Them Both Behind](http://bit.ly/2oCHulX) Bob Muenchen's take on the job market for various data science langauges.
0 commit comments