All talks can be found here: https://resources.rstudio.com/rstudio-conf-2020. I’ve picked 30 talks in total, so will split them into 3 parts with 10 talks each.
Part 2 is here
Part 3 is here
This talk by Jenny Bryan is another gentle introduction of R community to more software-oriented practices. Specifically, how do you fix an error? It might seem like an innocuous question, but it has a lot of complexity to it. The steps you need to take are rather simple, though:
- Create reprex (don’t forget about “minimal” part of reprex!).
- Add test for future self to not introduce the same error again.
I’m particularly fond of the second step with reprex since I’m spending some time on RStudio Community and it’s constant struggle for people who are willing to spend their time to help out, but they can’t without something as simple as reprex. If you find yourself in such a situation, take a gander at reprex’s website.
In my very limited experience of building Shiny apps I’ve already came across the situation where it’s a major pain in the back to change colors. Every single time you need to fiddle with them, I would think that it’s great that I’m a data scientist and not a front-end developer. But it turns out front-end developers already solved this problem with, e.g., SASS compilers that allow changing colors in Bootstrap with setting variables and running compiler on it. This means that changing just one variable changes everything about how your app looks like in a consistent manner.
There is also a cool way to run shiny app with self-updating backend where you can change variables and see what it does for your app. All in all - very useful (if surprisingly belated) addition to already impressive range of tools that Shiny provides.
This talk is about
tune package that is yet another building block in
tidymodels family of packages (my presentation about state of that in March, 2019 - here). I’m really enjoying seeing all of the (relatively) small improvements that come out of
tidymodels team and with each new addition the workflow that they propose look more and more solid. I especially liked the
.notes column that collects all the warnings/errors/etc from the fitting of the model as it shows that authors spend a lot of time around the topic (I mean, with Max Kuhn it goes without saying, but still).
State of the union address from Hadley about how things are going and where. One interesting thing for me personally is post-mortem of tidy evaluation that, apparently, wasn’t introduced in a way conducive to learning. In order to understand how it works you’d need to learn bunch of theory (you can take a look here where I made a presentation about it - tidy eval). The tension comes from the fact that in your “normal” life you rarely need all of it, so forcing it on people was a mistake. Similarly,
purrr will become less visible since it requires learning a bit more than you need day-to-day. Personally, those two things are probably my favorite places of
tidyverse, but I can see the point that Hadley made. Also, they are not going away, just becoming a bit less in your face for newcomers.
plumber is a nifty little package that is deeply integrated into RStudio Connect. But even without it, it allows data scientists to quickly create REST API endpoint to expose whatever you are doing to a wider world. James went over couple of scenarios of such interaction together with how it could be solved using
plumber functionality. I’ve used this package in the past and even then it worked fairly well, but with newer and newer features it becomes even more user-friendly.
Entertaining presentation by creator of
rayshader package where he showed a lot of crazy stuff that you can do entirely in R. Some (most) of the examples are really cool. Obviously, it works best for maps, but in one demo Tyler showed how you can visualize Earth and Moon orbiting around each other. It probably took multiple hours to render, but result is stunning.
- We’re hitting R a million times a day so we made a talk about it by Heather Nolis and Dr. Jacqueline Nolis
Data engineer and data scientist from T-Mobile shared their experiences about how to put R into production. There are a lot of interesting tidbits, but one that I’ve enjoyed hearing about the most is the fact that they lean heavily on Shiny to actually showcase their work to stakeholders. This is something that, for example, is not as common at my current workplace, but I can definitely see how beneficial it is whenever I do manage to sneak a little bit of Shiny into the mix.
Despite the name of the talk, brunt of the presentation was about new package from RStudio called
pins is a way to share R objects between multiple people. That’s basically it. For data sharing it’s probably better to use DB’s, but if you want to share, e.g., recipe from
recipes package, then
pins can be handy.
renv is a spiritual successor to
packrat. Just like tidy evaluation above,
packrat was a bit ahead of its time since it did provide a way to handle dependencies in R projects, but it did so in a rather convoluted way.
renv is a new version that supposed to fix most mistakes
packrat did. From multiple unrelated people I’ve already heard that
renv is indeed MUCH easier to use and overall delivers on this promise. So, good job Kevin :).
arrow is another cool project that tries to create “modern data frame”. It does so with bindings in multiple languages and often times it’s really efficient. In the latest (0.16.1 as of this writing) it no longer requires system dependencies on Linux and I just tried and it installed without any issue. Like
arrow is far from done, but it is cool to see steady incremental progress that comes out of Ursa Labs.