Text mining in R – Automatic categorization of Wikipedia articles
Text mining is currently a live issue in data analysis. Enoromus text data resourses on the Internet made it an important component of Big Data world. The potential of information hidden in the words...
View ArticlePlaying with GUIs in R with RGtk2
Sometimes when we create some nice functions which we want to show other people who don’t know R we can do two things. We can teach them R what is not easy task which also takes time or we can make GUI...
View ArticleR now will keep children away from drugs
Do you find this plot fancy? If yes, you can find the code at the end of this article BUT if you spend a little time to read it thoroughly, you can learn how to create better ones. We would like to...
View ArticleFaster, easier, and more reliable character string processing with stringi 0.3-1
A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") stringi is...
View Articlestringi 0.4-1 released – fast, portable, consistent character string processing
A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") Here’s a...
View ArticleInstalling Hadoop 2.6.0 on CentOS 7
by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Configuring a working Hadoop 2.6.0 environment on CentOS 7 is a bit of a struggle. Here are the steps we made to set...
View ArticleUsing Hadoop Streaming API to perform a word count job in R and C++
by Marek Gagolewski, Maciej Bartoszuk, Anna Cena, and Jan Lasek (Rexamine). Introduction In a recent blog post we explained how we managed to set up a working Hadoop environment on a few CentOS7...
View ArticleSimilaR
Introduction Being a teacher can be a very gratifying job. If you teach programming, which is your favorite hobby too, nothing can be better than that. Only thing can spoil your dream: cheating...
View ArticlePull the (character) strings with stringi 0.5-2
A reliable string processing toolkit is a must-have for any data scientist. A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). As for...
View ArticleSpeeding up R packages’ installation process
There is a time for some things, and a time for all things; a time for great things, and a time for small things — Miguel de Cervantes Building R packages from sources may take a long time, especially...
View Article
More Pages to Explore .....