Is there any documentation for installing impala from source (I have a customized codebase of impala) to a real cluster instead of the minicluster scripts included with the source code?
Given that the cluster already has CDH with HBase installed on it with cloudera manager.
Related
I'm trying to connect RStudio on a compute instance to the Oracle autonomous warehouse.
I've covered the following steps:
Provisioned Autonomous Data Warehouse
(https://docs.oracle.com/en/cloud/paas/autonomous-data-warehouse-cloud/tutorial-provision-adw/)
Stored the DB connection Wallet;
Installed Oracle Instant client basic package: https://www.oracle.com/technetwork/topics/winx64soft-089540.html
Downloaded and placed in the same directory as the instant client: SDK package & ODBC package
included Instant package path in environment veriable PATH ("C:/oracle_client/instantclient_18_5") and create OCI_LIB64 system variable with the same path
installed GTools checking the PATH options
Extracted the wallet to: C:\oracle_client\instantclient_18_5\network\admin
Now I'm experience an inssue installing the ROracle package within RStudio.
I'm using the following script:
Sys.setenv(OCI_LIB64="C:/oracle_client/instantclient_18_5")
install.packages("ROracle")
The error I got is the include headers have not been found:
Warning in strptime(xx, f, tz = tz) :
unable to identify current timezone 'C':
please set environment variable 'TZ'
* installing *source* package 'ROracle' ...
** package 'ROracle' successfully unpacked and MD5 sums checked
** using staged installation
Oracle Client Shared Library 64-bit - 18.5.0.0.0 Operating in Instant Client mode.
found Instant Client C:/oracle_client/instantclient_18_5
found Instant Client SDK C:/oracle_client/instantclient_18_5/sdk/include
ERROR: cannot find Oracle Client include headers in C:/oracle_client/instantclient_18_5/sdk/include.
Please set OCI_INC to correct location.
ERROR: configuration failed for package 'ROracle'
* removing 'C:/Users/opc/Documents/R/win-library/3.6/ROracle'
Warning in install.packages :
installation of package ‘ROracle’ had non-zero exit status
Everybody is saying to also set OCI_INC like this (but I do not have a directory in the manner "...client_1/oci/include/:
Sys.setenv(OCI_INC="C:/Oracle/product/11.2.0/client_1/oci/include")
Any idea what am I missing for the ROracle or in general?
R version: 3.6
Operating system: Windows Server on a compute instance
According to the documentation (https://docs.oracle.com/en/database/oracle/r-enterprise/1.5.1/oread/installing-oracle-database-instant-client.html#GUID-8C502BDD-C1A1-4EA0-9E88-D2F8452D4AC2) you need to download "Instant Client Package - SDK". When I checked the webpage for instant client the links for the SDK version were absent.
However,if you go here: https://www.oracle.com/database/technologies/instant-client/downloads.html and select you O/S on the following page there should be a link for the SDK version. For example, I use a Mac and the correct download is here:https://download.oracle.com/otn_software/mac/instantclient/193000/instantclient-sdk-macos.x64-19.3.0.0.0dbru.zip
Once I unzipped that file, the directory structure had an .../include folder with .h files that I think the installer needs to complete. Therefore, the correct settings on my Mac would be:
OCI_LIB64 set to /Users/keithlaker/Downloads/instantclient_19_3
OCI_INC set to /Users/keithlaker/Downloads/instantclient_19_3/sdk/include
One thing to note...I am not familiar with R but in broad general terms Oracle's in-database implementation of R (in-database R script repository and R object Datastore) that is part of the now free EE "Advanced Analytics" option is not part of Autonomous Database.
Therefore, your R project should be able to connect but you will not be able to use any of the usual in-database R packages because they don't exist.
Hope this helps.
Just found a thread on our internal forum which is worth sharing. The above might allow you to install the R client but ADB (ADW/ATP) does not support the use of ore.connect() and ore.disconnect(). Support for R is on our product map, see this presentation from OpenWorld 2019 - https://static.rainfocus.com/oracle/oow19/sess/1553766982670001yXnT/PF/CON4719%20OOW%20SF%202019%20-%20Hornick%20-%20OML%20Overview%20and%20Roadmap%20v1.4_1568740954169001k3Df.pdf
For the latest versions of SparkR (1.6 and 2.0) do I need R to be installed ? If I need R to run sparkR is it required to install R on all data nodes in a Yarn cluster with CDH and spark?
Yes, you need:
R installation.
all packages and native dependencies
installed on each worker node.
When I try to install a previous version of Cloudera with all packages(Cloudera Manager, CDH with all applications and so forth) through install manager binary always installs the lastest version of Cloudera (now 5.8) even if the manager installer bin was found in the Cloudera 5.4 directory in the repository. I want to install Cloudera 5.4. For that, I have found Cloudera 5.4 RPMs in the repository to install manually in RHEL but when I install them, yum finds dependency errors with the database and daemon packages perhaps because of a previous failure in the installation. Because of this I think this is not the best manner of install Cloudera 5.4. Anyone that knows how to install a previous version of Cloudera or have installed or have experience in that or can give me some advise on how to install it? Thanks!
While, I'm not clear on why you would want an older version of CM, you can adjust your cloudera-manager.repo file to point to the specific version you want. Something like this:
[cloudera-manager]
name = Cloudera Manager, Version 5.4.10
baseurl = https://archive.cloudera.com/cm5/redhat/6/x86_64/cm/5.4.10/
gpgkey = https://archive.cloudera.com/redhat/cdh/RPM-GPG-KEY-cloudera
gpgcheck = 1
I am trying to install teradata utilities 15.x,
I earlier installed teradata ODBC drive and .NET Data Provider for teradata.
Now if I start the installation, I get an error saying:
"The installation failed for the following reason:
A TTU 14.0 suite is installed. Please uninstall all TTU 14.10 suites before installing TTU 15.00"
I tried to uninstall the suite using uninstall_TTU.vbs, but I get below error:
TTUSuiteSilent.exe was not found in .\TTU directory.
This script needs TTUSuiteSilent.exe to remove suite packages.
Aborting uninstallation....
Any advise on how to get this uninstalled?
The Teradata ODBC driver typically requires three components to be installed on the client:
ICU Libraries
GSS Libraries
ODBC Driver
You must ensure that all three of these have been removed before you can replace them with another version. Typically, they can be found in the Add/Remove Programs applet of the Windows Control Panel.
I had install CDH 4.2.1 manually without using cloudera manager using the installation guide on http://www.cloudera.com/content/cloudera-content/cloudera-docs/CDH4/4.2.0/PDF/CDH4-Installation-Guide.pdf site.
Now i want to upgrade the version from 4.2.1 to 4.3 without using cloudera manager.Can anyone guide me to do this.
First of all, Cloudera Manager makes CDH installation and upgrades so much easier. It's worth installing for that reason alone (and it's included in the free Cloudera Standard package).
As you've already done a manual install however, a manual upgrade is the only realistic option -- this doc explains how.
For upgrading CDH 4.2.1 to CDH 4.3 manually , you need to consider updating the different services running on cluster.
The parcels or packages you have downloaded while installations is of 4.2.1. In other words we can say these supports only 4.2.1 , so need to download rpm or packages of different services install it manually in your system. I have updated my system from CDH 3 to CDH 4.8.
It's running in my case . Enjoy :)