EMR bootstrap action to run Hue on Mapr M3 - bigdata

Is there some bootstrap script to get hue running on EMR MapR, unlike setting up using this guide http://doc.mapr.com/display/MapR/Configuring+Hue

Related

I have installed Apache Airflow 2.0 but DAGs are not visible

I followed this [tutorial][1] on getting Airflow v2 set up. This set up come with an example DAG unfortunately they are not visible on my webserver UI dags page.
I have tried to install again but did not help, I have used astro start/ astro stop also without any success
My Dockerfile has only one line FROM quay.io/astronomer/ap-airflow:latest-onbuild
I am a beginner and trying to learn more.
[1]: https://www.astronomer.io/guides/get-started-airflow-2

Airflow SparkSubmitOperator with Yarn Cluster Mode not being able to track application status

I started reading about how we could run Spark batch jobs using Airflow.
I have tried using SparkSubmitOperator in local and it works fine. However, I need a recommendation, if we could use it in cluster mode.
The only problem I see when using in cluster mode is that, here the application status not being able to be tracked,ref shared in the link below:
https://albertusk95.github.io/posts/2019/12/airflow-tracks-spark-driver-status-cluster-deployment/
Please suggest if anyone has tried using this operator and works well in cluster mode, or if there is any issue using it.

Adding nodes to Cloudera Quickstart VM

I have Cloudera Quickstart VM installed and it is single node. How can I add multi nodes to it and make it as a cluster ? I am now using Virtualboax and tried to clone the base quickstart VM and then network it and use the Add Cluster wizard in Cloudera manager. But it is failing. Anyone knows how to add multiple nodes to it ?
Your quickest/easiest option (instead of using a VM) is to install the multi-node version of QuickStart for Docker:
http://blog.cloudera.com/blog/2016/08/multi-node-clusters-with-cloudera-quickstart-for-docker/
Or, you could use Vagrant to set up a virtualized multi-node cluster:
http://blog.cloudera.com/blog/2014/06/how-to-install-a-virtual-apache-hadoop-cluster-with-vagrant-and-cloudera-manager/

Running gulp as a service

We have a project where the developers (from what I understand) use gulp to run a website locally using Vagrant. They want to deploy this website on an AWS instance.
We are trying to implement the commands using Jenkins. The website stays up while gulp serve-dev is running, but then Jenkins times out and Nginx returns a 502 error. Of course we can prevent Jenkins from timing out but then the job would need to keep running.
Is there away to run this command as a service? any other way we can go about this ?
You want to run gulp in the backgroud without stoppping when you close terminal or log out from the server.
Try using nohup(short for no hangup) which runs command with hangup ignored so that your command continue running even if you logout from the server
try running it using nohup gulp serve-dev &
& put command in background mode so that you can continue using the current shell screen.
Cheers!

Running java based webdriver scripts in unix server

I have written webdriver+TestNG scripts in windows using java. Now there is an upcoming requirement to run the same scripts in UNIX server without much of code modification. I have heard about running tests in headless browser, but I dont know about this much. I searched a lot but there is no clear and simple response to start looking into this.
Is it possible to just change the driver instance to htmlunitdriver and run the same in unix environment? How could I create the tests as a package and move to unix environment for running the scripts?
You can run your existing script by making few changes in driver instance creation. Suppose you are using Firefox driver for your test, then you can run your script in headless mode using virtual display Xvfb (Xvfb is an X server that can run on machines with no display hardware and no physical input devices).
Below are the steps to run your tests in headless mode:
Install Xvfb
Start the Xvfb by executing this command Xvfb :99 -screen 0 1024x768x24 & Xvfb Manual
Then create a Firefox driver instance that uses virtual display started in step 2 as below:
FirefoxBinary fb = new FirefoxBinary();
fb.setEnvironmentProperty("DISPLAY", ":99");
WebDriver driver = new FirefoxDriver(fb,null);
Now your script will run in headless mode. You may need to change few other things like path of your test data or any other references that uses windows file system (like C:\)

Resources