Unable to configure rHive for HDP2.4 Sandbox - r

Problem
I have hortonworks sandbox 2.4 on virtual box. I am following the tutorial found here for installing rHive on the sandbox. I am unable to duplicate steps 4 and 6 without producing errors. They are as follows:
Step 4 Error: Being in the ~/RHive/ directory and using ant build gives me the following error:
BUILD FAILD /root/RHive/build.xml:39: /root/RHive/usr/hdp/current/hive-server2/lib does not exist
Step 6 Error: Using R CMD INSTALL RHive_2.0-0.10.tar.gz produces the following error:
ERROR: dependencies "rJava", "Rserve" are not available for package "RHive"
Attempts
I have followed the directions as specified on here as well, which is the Rhive documents, but unfortunately have gotten nowhere.
Step 4: I am at a loss as to why ant build could be failing. I have verified I installed it correctly and it states Apache Ant(TM) version 1.9.7 compiled April 9 2016 when I run command ant -version. So I followed those procedures correctly.
Step 6:
I used rStudio to install rJava and Rserve using install.package() command. Indeed, the tutorial suggests this as well. I suspect something is wrong with my java dependencies?
I have used Ambari to use Hive before, but this is the first time I am trying to use it in R and I am abviously still new to the hortonworks vm, so I would appreciate any kindness and assistance to help me fix the issues I am encountering.

I think the answer is to abandon rHive. It was yanked from CRAN. Anyone thinking about using it with R and finds this post, please consider RJDBC.

Related

Installing Julia Packages via GitHub

To preface this question I am coming from the perspective of an absolute beginner, trying to learn Julia. I was recommended to try the SciML tutorial and in trying to install it have fallen at the first hurdle.
So far I have:
Installed Julia
Installed Anaconda
Installed Jupyter Notebook
Added "git" through the Anaconda console
Added "IJulia"
From here I have been able to open Jupyter through Anaconda and access Julia 1.7.3 within it. Following this I attempted to follow the instructions on the SciML website for getting started with the tutorial, which is to enter this code:
using Pkg
pkg"add https://github.com/SciML/SciMLTutorials.jl"
using SciMLTutorials
SciMLTutorials.open_notebooks()
However, the second line is throwing this error
invalid git HEAD (reference 'refs/heads/master' not found)
I have also tried installing the package through the Julia console, but recieve the same error.
I am at a loss of how to proceed - I can't find anything in the documentation that suggests why this may be or how to proceed - since I am so new to this I suspect I may have missed a step in my installation process, but can't figure out where that may be.

How to get geoR and sp R packages working in AWS Linux EC2 AMI

My end goal is to be able to use the geoR and sp packages in R on an AWS EC2 Linux instance. The first hurdle is getting an older version of R since 3.3 doesn't support TCL/TK (at least that I've found).
After unpacking R-3.2.5.tar.gz and following the instructions in the INSTALL file (along with these configure options), a search in the config.log shows the following for TCL and TK:
configure:32908: checking for tclConfig.sh
configure:32950: checking for tclConfig.sh in library (sub)directories
configure:33143: checking for tcl.h
conftest.c:228:17: fatal error: tcl.h: No such file or directory
#include <tcl.h>
config.status:1893: creating src/library/tcltk/DESCRIPTION
config.status:1893: creating src/library/tcltk/Makefile
config.status:1893: creating src/library/tcltk/src/Makefile
ac_cv_env_TCLTK_CPPFLAGS_set=
ac_cv_env_TCLTK_CPPFLAGS_value=
ac_cv_env_TCLTK_LIBS_set=
ac_cv_env_TCLTK_LIBS_value=
r_cv_header_tcl_h=no
TCLTK_CPPFLAGS=''
TCLTK_LIBS=''
TCL_CONFIG=''
use_tcltk='no'
R installs successfully, but running capabilities("tcltk") confirms that something went wrong, as it returns FALSE.
Following the hints here, I searched for tclConfig.sh and tkConfig.sh, and realized I hadn't installed the appropriate dependencies, namely tcl-devel, and tk-devel.
Running yum list available shows a package for tcl-devel.x86_64 but nothing for tk. I downloaded source files for TCL and TK from here, but couldn't find source files for any devel packages.
So now I'm at a loss on how to proceed. Has anyone been through this process that can shed some light on it?
UPDATE
After reading the comment below, I'm afraid I didn't do my due diligence in researching TCL/TK and R 3.3.
My very early efforts to troubleshoot the whole problem consisted of the following workflow- First, without even considering an older version of R, I ran install.packages("geoR") in 3.3.2. This resulted in installation of package 'geoR' had non-zero exit status, along with:
Error: package 'tcltk' does not have a namespace
ERROR: lazy loading failed for package 'geoR'
So, naturally, I ran install.packages("tcltk") which resulted in package 'tcltk' is not available (for R version 3.3.2)
As I mentioned before, this was very early on. I took the message literally and didn't think to look further into TCL/TK for 3.3, but instead started looking to get an earlier version of R, since I had been successfully running geoR and sp on another system that had R 3.2.5.

Installing R on Apache Zeppelin

I'm trying to install Apache Zeppelin on my old computer that runs Ubuntu. So far, I'm able to install Zeppelin very easily by cloning the latest 0.6.0 snapshot release using
git clone https://github.com/apache/incubator-zeppelin.git
cd incubator-zeppelin
mvn clean package -DskipTests
but I want to have R on Zeppelin. Supposedly, the 0.6.0 snapshot has two R interpreters, but when I run the R tutorial (the pre-made note that uses %r), I get this list of errors.
I followed several guides to try and install R as an interpreter, but each one resulted in some kind of error. I tried this instructional:
http://www.r-bloggers.com/interactive-data-science-with-r-in-apache-zeppelin-notebook/, and got a build failure on "R Interpreter". The error message was
"dependency 'evaluate' is not available for package 'rzeppelin'
* removing '/home/rebecca/Zeppelin-With-R/R/lib/rzeppelin'"
and then a bit lower down
Failed to execute goal org.codehaus.mojo:exec-maven-plugin:1.2.1:exec (default) on project zeppelin-zrinterpreter: Command execution failed. Process exited with an error: 1
I also tried this Stack Overflow guide: Anyone tried to add R interpreter onto Apache Zeppelin?, and while I was able to run incubator-zeppelin, I received an error when I used either the %spark.r or %r interpreter tags, saying both "interpreter not found" and "prefix not found". Spark doesn't work either after following the first solution, getting the same error mentioned in the second solution (the jar file not being there), and then trying the second solution.
Does anyone have a guide for installing R onto the newest version of Zeppelin? I'm very flexible in the way I can install it. I can run other operating systems onto my computer, and I also have Virtual Box installed on my other computer, which is a mac.
There is currently a bug in the latest HEAD of zeppelin that was recently introduced and prevents the R interpreter from launching cleanly
Did anyone created a Zeppelin Jira Issue for that?
For me it is working on Zeppelin branch-0.6
build Zeppelin with r profile: -DskipTests -Prthis will...
create a directory 'R' in git repo root
copy the 'zeppelin-rinterpreter*.jar' into git_repo_root/interpreter/spark
build Zeppelin with build distro profile: e.g. -DskipTests -Pbuild-distr -Pspark-1.6 -Phadoop-2.6
use zeppelin-distribution/target/zeppelin*.tar.gz for installation
ensure both 1.1 and 1.2 are present in your zeppelin installation
The error you're getting is that you need to have the R package evaluate installed. You can install this simply by launching R and typing install.packages('evaluate').
That said, your excerpt mentions the directory Zeppelin-with-R. That's my repo, which is the R interpreter in the form when it was accepted into Zeppelin. That is version 0.5.6, not 0.6.0. There is currently a bug in the latest HEAD of zeppelin that was recently introduced and prevents the R interpreter from launching cleanly. Your best bet for now is to use the one from my repo and install clean, without trying to pull-in from Zeppelin HEAD.

Failed to install R on Centos 7

I have CentOS 7 (3.10.0-123.el7.x86_64) and I seem not be able to install R and then get Rstudio server working.
This is what I did:
rpm -Uvh http://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm
yum install R
Then I got this error:
Transaction check error:
file /usr/lib/systemd/system/blk-availability.service from install of device-mapper-7:1.02.107-5.el7_2.1.x86_64 conflicts with file from package lvm2-7:2.02.105-14.el7.x86_64
file /usr/sbin/blkdeactivate from install of device-mapper-7:1.02.107-5.el7_2.1.x86_64 conflicts with file from package lvm2-7:2.02.105-14.el7.x86_64
file /usr/share/man/man8/blkdeactivate.8.gz from install of device-mapper-7:1.02.107-5.el7_2.1.x86_64 conflicts with file from package lvm2-7:2.02.105-14.el7.x86_64
I am not sure what it is. There is nothing from whereis R because it didn't install.
If I install Rstudio Server, it would fail to start the service and said ERROR unable to find installation of R in the system
Can someone tell me how to install R properly on Centos 7?
That is not related to R. Your log shows a conflict between "device-mapper" and "lvm2", and these packages are used to operate with devices such as hard disks and partitions (before someone downvotes me, YES this is a crude simplification).
You should try to update your system with yum before installing R.
I resolved this problem by updating lvm2 firstly, then install R.
This idea was from: https://bugzilla.redhat.com/show_bug.cgi?id=1294128

RApache Configure Does Not Recognize R Installed with Shared Library

All: Apologies in advance for what I hope is an easy question. It's been many years since I've worked in a Unix(-like) environment...
I'm trying to install RApache on a web server running the RHEL5 64-bit OS. I've successfully installed Apache and confirmed it works. I've also successfully installed R (R-2.13.0) with shared library. I've confirmed that libBlas.so and libR.so are installed; location: /R/R-2.13.0/lib
However, when I try to configure RApache ( sudo ./configure --with-apache2-apxs=/usr/local/apache2/bin/ --with-R=/R/R-2.13.0/) I get the following error:
R was not built as a shared
library Either build it with one,
or use another install of R
configure: error: aborting!
I've reinstalled R twice now and have confirmed in libtool that it says:
# Whether or not to build shared
libraries. build_libtool_libs=yes
Finally, I've searched high and low for some path or flag I need to set without success. Any help would be greatly appreciated.
Thanks,
Ron
You need to compile R with the appropriate flags. At the configure stage you need to include --enable-R-shlib so that R is built as a shared library.
You may need to further consult appendix B1 of http://cran.r-project.org/doc/manuals/R-admin.pdf which describes further issues.

Resources