-
Notifications
You must be signed in to change notification settings - Fork 3
/
Copy pathch-032-learning_r.html
429 lines (377 loc) · 19.6 KB
/
ch-032-learning_r.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta charset="utf-8" />
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name="generator" content="pandoc" />
<title>ch-032-learning_r.utf8.md</title>
<script src="ch-032-learning_r_files/jquery-1.11.3/jquery.min.js"></script>
<meta name="viewport" content="width=device-width, initial-scale=1" />
<link href="ch-032-learning_r_files/bootstrap-3.3.5/css/readable.min.css" rel="stylesheet" />
<script src="ch-032-learning_r_files/bootstrap-3.3.5/js/bootstrap.min.js"></script>
<script src="ch-032-learning_r_files/bootstrap-3.3.5/shim/html5shiv.min.js"></script>
<script src="ch-032-learning_r_files/bootstrap-3.3.5/shim/respond.min.js"></script>
<script src="ch-032-learning_r_files/navigation-1.1/tabsets.js"></script>
<style type="text/css">
h1 {
font-size: 34px;
}
h1.title {
font-size: 38px;
}
h2 {
font-size: 30px;
}
h3 {
font-size: 24px;
}
h4 {
font-size: 18px;
}
h5 {
font-size: 16px;
}
h6 {
font-size: 12px;
}
.table th:not([align]) {
text-align: left;
}
</style>
<link rel="stylesheet" href="textbook.css" type="text/css" />
</head>
<body>
<style type = "text/css">
.main-container {
max-width: 940px;
margin-left: auto;
margin-right: auto;
}
code {
color: inherit;
background-color: rgba(0, 0, 0, 0.04);
}
img {
max-width:100%;
height: auto;
}
.tabbed-pane {
padding-top: 12px;
}
.html-widget {
margin-bottom: 20px;
}
button.code-folding-btn:focus {
outline: none;
}
summary {
display: list-item;
}
</style>
<div class="container-fluid main-container">
<!-- tabsets -->
<style type="text/css">
.tabset-dropdown > .nav-tabs {
display: inline-table;
max-height: 500px;
min-height: 44px;
overflow-y: auto;
background: white;
border: 1px solid #ddd;
border-radius: 4px;
}
.tabset-dropdown > .nav-tabs > li.active:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open > li.active:before {
content: "";
border: none;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}
.tabset-dropdown > .nav-tabs > li.active {
display: block;
}
.tabset-dropdown > .nav-tabs > li > a,
.tabset-dropdown > .nav-tabs > li > a:focus,
.tabset-dropdown > .nav-tabs > li > a:hover {
border: none;
display: inline-block;
border-radius: 4px;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open > li {
display: block;
float: none;
}
.tabset-dropdown > .nav-tabs > li {
display: none;
}
</style>
<script>
$(document).ready(function () {
window.buildTabsets("TOC");
});
$(document).ready(function () {
$('.tabset-dropdown > .nav-tabs > li').click(function () {
$(this).parent().toggleClass('nav-tabs-open')
});
});
</script>
<!-- code folding -->
<div class="fluid-row" id="header">
</div>
<div id="i-am-always-doing-that-which-i-cannot-do-in-order-that-i-may-learn-how-to-do-it." class="section level2">
<h2>“I am always doing that which I cannot do, in order that I may learn how to do it.”</h2>
<blockquote>
<p>Pablo Picasso</p>
</blockquote>
<p><img src="http://www.galleriesnow.net/wp-content/uploads/2019/05/Picasso.jpg" /></p>
<center>
<h2>
Nailed it
</h2>
</center>
</div>
<div id="learning-how-to-learn" class="section level1">
<h1>LEARNING HOW TO LEARN</h1>
<div class="tip">
<h2 id="key-concepts">Key Concepts</h2>
<ul>
<li>As you begin learning a new programming language you need to master nouns and verbs.</li>
<li>Nouns represent the jargon specific to the field (objects, arguments, assignment, etc.).</li>
<li>Verbs represent functions.</li>
<li>Fluency takes time! Find ways to practice “speaking R” productively.</li>
</ul>
<p><strong>Learning to Learn</strong></p>
<ul>
<li>R has thousands of packages and new ones are created daily. There is no way to learn everything about the language.</li>
<li>The goal, rather, is figuring out how to quickly master new functions so you can leverage R for whatever problem you are working on.</li>
<li>This requires learning to read help files and adapt code, and learning how to get help when you are stuck.</li>
</ul>
<p><br> <br></p>
</div>
<p><br></p>
<div id="learning-how-to-learn-1" class="section level2">
<h2>Learning How to Learn</h2>
<hr />
<p>Let’s begin this chapter by stating and unpacking a few assertions:</p>
<ol style="list-style-type: decimal">
<li>Learning R is very similar to learning a spoken language.</li>
<li>Fluency in a programming language is different than fluency in a spoken language.</li>
<li>The ultimate end to learning R is to <em>learn how to learn</em> so you can quickly master whatever packages are necessary for your current project.</li>
</ol>
<hr />
<p><br></p>
<div id="similarities-between-human-machine-languages" class="section level3">
<h3>Similarities Between Human & Machine Languages</h3>
<p>The similarities between human languages (e.g. English, Arabic, Mandarin) and machine languages (e.g. R, Python, JavaScript) are astounding, but a nuanced discussion as to why is beyond the scope of this introduction.</p>
<p>As examples, however, both human and machine languages:</p>
<ul>
<li>Persist or disappear depending on the size of communities that use them</li>
<li>Derive from older languages and have evolved into new languages</li>
<li>Differ semantically but are arranged with similar grammatical elements (e.g. nouns, verbs)</li>
</ul>
<p><br></p>
<p>Why is this important? <em>You learn R the same way you would a foreign language</em>:</p>
<ul>
<li><strong>Vocabulary:</strong> You need to learn words for things before all else.</li>
<li><strong>Reinforcement:</strong> You learn by doing and repeating.</li>
<li><strong>Immersion:</strong> You can’t be fluent without being immersed. Get involved in professional and volunteer work, podcasts, and regular reading about your topic.</li>
<li><strong>Necessity:</strong> You learn best when success or survival depends on it. Use R in projects that have deadlines as a commitment mechanism.<br />
</li>
<li><strong>Languages are Social:</strong> You learn faster through collaboration, meetups, forums, conferences.</li>
</ul>
<p>If this is your first programming language, you will get frustrated at times. Take a step back and remember that after a semester of Spanish you can only operate at the level of a three-year old. You only know one verb tense, a few dozen verbs, and several hundred words. You have so many emotions that you can’t express in your new language!</p>
<p>Similarly, you are going to have so many ideas that you want to to translate into code! More realistically you will end up melting down in office hours and then taking a nap.</p>
<p><br></p>
<div class="note">
<p><strong>Fun Fact:</strong> The popular <a href="https://ggplot2.tidyverse.org/">ggplot2</a> data viz package is based on Leland Wilkinson’s framework, “The Grammar of Graphics” (1999), i.e. the <code>gg</code> in <code>ggplot2</code>. Similarly, the <a href="https://dplyr.tidyverse.org/">dplyr</a> package is described as a “grammar of data manipulation”, in which functions are described as “verbs”.</p>
</div>
<p><br></p>
</div>
<div id="you-cant-learn-everything-in-r-nor-should-you" class="section level3">
<h3>You Can’t Learn Everything in R (Nor Should You)</h3>
<p>There are thousands of R packages. The body of knowledge in the field is growing exponentially. It’s impossible (and unnecessary) to learn (or retain) everything the language offers.</p>
<p>Data science is a fast-moving field of practice.</p>
<ul>
<li>New R packages are released every day.</li>
<li>Packages and functions are constantly being updated, replaced, consolidated, or deprecated.</li>
<li>There are nuances to every computational science applied to each natural or social science.</li>
</ul>
<p>In sum, to remain current in R, one must remain current with… everything.</p>
<p>Accepting these limitations is liberating. It allows one to focus on an <em>ab initio</em>, or “first principles” approach to learning R. That is, build solid foundations, then specialize as necessary.</p>
<p><br></p>
</div>
<div id="the-ultimate-objective-learning-how-to-learn-r" class="section level3">
<h3>The Ultimate Objective: Learning How to Learn R</h3>
<p>There are few telltale signs that you’re becoming fluent in R. For example, you may:</p>
<ul>
<li>Achieve the same results as your old code with significantly less code</li>
<li>Use search engines less frequently or not at all while coding</li>
<li>Be approached by others for collaborative work</li>
<li>Appraise the quality of code and data products</li>
<li>Begin dreaming in R (just kidding, not kidding…)</li>
</ul>
<p><br></p>
<p>The goal of “learning R” is just <em>learning how to learn R</em>. This means knowing when to step on the gas and try to power through a problem, and knowing when to hit the brake in order to take a step back, assess, and make a plan. Knowing when and how to ask for help is a basic level of situational awareness you need to develop.</p>
<p>For example, imagine you’ve reached some impasse that the “first principles” approach to R fails to resolve. Do you:</p>
<ul>
<li>Pore over online forums?</li>
<li>Explore documentation and vignettes?</li>
<li>Search for packages with potential solutions?</li>
<li>Boldly install and learn said package(s) without hesitation?</li>
<li>Recover when fail to resolve the issue? Fail and recover again?</li>
<li>Remain firm in your conviction that you will resolve the issue?</li>
</ul>
<p><br></p>
<p><img src="https://imgs.xkcd.com/comics/wisdom_of_the_ancients.png" width="60%" style="display: block; margin: auto;" /></p>
<center>
<em>The struggle is real. Source: <a href="https://xkcd.com/979/">XKCD</a></em>
</center>
<p><br></p>
<p>This is only one of many hypothetical scenarios. Learning how to learn R is not a matter of learning something new everyday (e.g. a function, package, technique, or theory), but the cyclical process of learning in the face of necessity, shoring up your knowledge over some indefinite period, and learning again when the need arises. Above all, you’ll know you’ve learned how to learn R when you no longer doubt your ability to resolve what once seemed impossible.</p>
<p><br></p>
<div class="tip">
<h2 id="you-are-not-alone-in-your-quest">You are not alone in your quest</h2>
<p><img src="https://media1.tenor.com/images/7b68a2d2ce39aeb58b700a212d736655/tenor.gif?itemid=7237007" /></p>
<p>When working on labs make sure you have a game plan for success. If you are stuck for more than 20 minutes on the same problem <em>without making progress</em> you should (1) phone a friend, (2) post to the <strong>Get Help</strong> discussion board on the course site, (3) attend a review session, or (4) schedule virtual office hours.</p>
<p>Your morale is a limited commodity, so spend it wisely! If you are doing the wrong thing over and over while stuck you might actually be reinforcing bad habits.</p>
<p>The qualifier <em>without making progress</em> is a relative term. You will spend more than 20 minutes on a problem without arriving at a solution. Stuck means you don’t understand the question or you are out of things to try.</p>
<p>Technically, for 3-credit class you should be spending about 12 hours per week on the class (3 hours of class time, 9 hours on homework/projects). If it’s a condensed semester (7.5 week schedule) that translates to 24 hours per week.</p>
<p>This will vary for everyone depending on background knowledge, speed, and interest! But it’s a good rule of thumb to keep in mind, and a good reminder that you are not slow or incapable - there is a reason that they call education “investment in human capital”!</p>
<p><br> <br></p>
</div>
<p><br> <br></p>
</div>
</div>
<div id="troubleshooting-in-r" class="section level2">
<h2>Troubleshooting in R</h2>
<p>Your code is going to fail. A lot.</p>
<p><img src="https://media3.giphy.com/media/ZeB4HcMpsyDo4/source.gif" /></p>
<p>We encounter errors all the time. The following is a brief overview for recognizing a problem, as well as myriad strategies to overcome one.</p>
<p><br></p>
<div id="recognizing-understanding-trouble" class="section level3">
<h3>Recognizing & Understanding Trouble</h3>
<blockquote>
<p>“The first step in fixing a broken program is getting it to fail repeatably.” (Tom Duff)</p>
</blockquote>
<p><br></p>
<p>Two simple criteria must be met in order to recognize trouble with your code (<a href="https://bookdown.org/rdpeng/RProgDA/">Peng et al. 2017</a>):</p>
<ol style="list-style-type: decimal">
<li>You expect your code to behave in a certain way</li>
<li>Your code doesn’t behave as expected</li>
</ol>
<p>Oftentimes, the best case scenario is that R will throw an <strong>error message</strong>. Error messages signal that R has encountered some fatal flaw in your code and, in consequence, nothing happens.</p>
<p>The next best scenario is that R will throw a <strong>warning message</strong>. Warning messages signal that R has successfully evaluated and executed code, but something unexpected happened, and R wants you to know about it.</p>
<p>The worst and most pernicious scenario is when your code runs perfectly fine, without error or warning messages, yet the output isn’t exactly what you’d wanted. If you recognize this, that’s great. If you don’t recognize this, errors flow downstream and can be difficult to detect and fix. <em>Always inspect your output</em>.</p>
<p><br></p>
<div class="tip">
<p><strong>Pro Tip:</strong> Oftentimes, users place too much emphasis on fixing their expected results and fail to consider their original expectations. <em>Never neglect revisiting your expectations</em>, as it’s possible that they, not your code, may explain an error (ibid.).</p>
</div>
<p><br></p>
</div>
</div>
</div>
<div id="the-learning-curve" class="section level1">
<h1>The Learning Curve</h1>
<p>Are you on track? How should you measure progress?</p>
<p>Again, the language metaphor comes in handy.</p>
<div id="vocab" class="section level3">
<h3>Vocab</h3>
<p>We will emphasize vocabulary the first course. The concepts in programming are not hard, but they require a lot of jargon.</p>
<p>Most help files were written by computer scientists or statisticians. In other words, weirdos. Don’t be surprised if it takes time to, um, decode their syntax.</p>
</div>
<div id="adapting-code" class="section level3">
<h3>Adapting Code</h3>
<p>One of the first skills you learn while programming is how to start from existing code and adapting it for a new purpose. It is way more common to do this than to write code from scratch.</p>
<p>An important milestone is reaching a point where you can read code written by others and understand it without having to run it.</p>
</div>
<div id="critical-thinking" class="section level3">
<h3>Critical Thinking</h3>
<p>You might think that the most valuable skill you take from this course is knowledge of R. This is not correct.</p>
<p><strong>The real value in learning to program is learning how to think critically with data and problem-solve.</strong></p>
<p>Programming languages are rigid and unforgiving.</p>
<blockquote>
<p>Did you use a capital letter when I was expecting a lower-case one? I’m sorry, I can’t run your program then.</p>
</blockquote>
<p>Although frustrating, it does teach your precision and discipline. You need to think very systematically about the problem in order to design something that works.</p>
<p>You will learn to value <strong>pseudo-code</strong>, which is a way of sketching out a basic blue-print for a script in plain English before you write any code. Then you can work incrementally and translate each step into R, refining as you go.</p>
</div>
<div id="writing-your-own-code" class="section level3">
<h3>Writing Your Own Code</h3>
<p>Drafting code from scratch is the equivalent of fluency in programming. This will come in time, but do not expect to jump right into creating elegant scripts. You need to practice the skills above, develop your vocabulary, master some data verbs (functions) before it all comes together. But you are on your way!</p>
</div>
<div id="a-preview-of-whats-ahead" class="section level3">
<h3>A Preview of What’s Ahead</h3>
<p><img src="figures/learning-curve-01.png" width="90%" style="display: block; margin: auto;" /></p>
<p><br></p>
<p><img src="figures/learning-curve-02.png" width="90%" style="display: block; margin: auto;" /></p>
<p><br></p>
<p><img src="figures/learning-curve-03.png" width="90%" style="display: block; margin: auto;" /></p>
<p><br></p>
</div>
<div id="further-resources" class="section level2">
<h2>Further Resources</h2>
<p>The following resources may prove valuable for both learning how to learn R and troubleshooting:</p>
<p><strong>I) Full-Length Text</strong></p>
<ul>
<li><a href="https://rpubs.com/jamisoncrawford/nutsandbolts">“Intro to R: Nuts & Bolts”</a> (Crawford, 2018)</li>
</ul>
<p><strong>II) Documentation & Vignettes</strong></p>
<ul>
<li><a href="http://r-pkgs.had.co.nz/vignettes.html">“Vignettes: Long-Form Documentation”</a> (Wickham, 2019)</li>
</ul>
<p><strong>III) Forums, Email Lists, & Communities</strong></p>
<ul>
<li><a href="https://stackoverflow.com/">Stack Overflow</a></li>
<li><a href="https://stats.stackexchange.com/">Cross Validated</a></li>
<li><a href="https://rpubs.com/jamisoncrawford/nutsandbolts">R Mailing Lists</a></li>
<li><a href="https://www.reddit.com/r/rstats/">Reddit’s r/Rstats</a></li>
<li><a href="https://twitter.com/hashtag/rstats?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Ehashtag">Twitter’s #Rstats</a></li>
</ul>
<p><strong>IV) Asking R-Related Questions</strong></p>
<ul>
<li><a href="http://www.catb.org/esr/faqs/smart-questions.html">“Hot to Ask Questions the Smart Way”</a> (Raymond, 2014)</li>
<li><a href="https://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example">“How to Make a Great R Reproducible Example”</a> (Stack Overflow, 2018)</li>
<li><a href="https://www.r-project.org/posting-guide.html">“Posting Guide: Hot to Ask Good Questions that Prompt Useful Answers”</a> (Plate, 2013)</li>
<li><a href="http://rex-analytics.com/decoding-error-messages-r/">Errors vs Warnings</a> (Rex Analytics)</li>
</ul>
<p><strong>V) ASCII Dragons</strong></p>
<ul>
<li><a href="https://danadaldos.wordpress.com/2015/02/06/padded-with-%E2%88%8F/">danadaldoss</a></li>
<li><a href="https://www.dougsartgallery.com/ascii-art-dragon.html">Doug</a></li>
<li><a href="https://www.deviantart.com/ahmadsaifulhakim/art/Dragon-ascii-art-525981480">Ahmad</a></li>
</ul>
<p><br> <br></p>
</div>
</div>
</div>
<script>
// add bootstrap table styles to pandoc tables
function bootstrapStylePandocTables() {
$('tr.header').parent('thead').parent('table').addClass('table table-condensed');
}
$(document).ready(function () {
bootstrapStylePandocTables();
});
</script>
<!-- dynamically load mathjax for compatibility with self-contained -->
<script>
(function () {
var script = document.createElement("script");
script.type = "text/javascript";
script.src = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML";
document.getElementsByTagName("head")[0].appendChild(script);
})();
</script>
</body>
</html>