Autonomous Database in the Oracle Cloud; Can I run R Scripts Natively Like with Oracle Enterprise? - r

In the past few months Oracle has included an Autonomous Container Database option in the Cloud. I was wondering if it allows for setup and configuration of running R scripts in the Db Engine like how once could previously with Oracle Enterprise Database?

Yes! This is possible since late August 2022: https://blogs.oracle.com/post/oracle-machine-learning-for-r-now-available-for-oracle-autonomous-database

Related

Flyway compatibility with Amazon Aurora

We use Flyway with our AWS RDS MySQL and Redshift instances, and it is working great! We are considering switching from RDS MySQL to RDS Aurora. Amazon says that Aurora is MySQL compatible, but wanted to know if there are any known incompatibilities between Aurora and Flyway?
Based on the AWS Aurora product details:
Amazon Aurora is designed to be compatible with MySQL 5.6, so that existing MySQL applications and tools can run without requiring modification.
Given that you're migrating from MySQL database, you won't even need to change the JDBC driver for the Flyway config. I have been using Flyway together with AWS Aurora for a while and haven't noticed any issues.
I am not familiar with flyway, but here is main restrictions to use Aurora:
Aurora supports only InnoDB storage engine (MyISAM not supported), so your database should be compatible with it. For example, it means that maximum row size limited to 16Kb.
Geo spatial indexes are not supported

Teradata SQL Assistant: Alternatives {on Windows}

Are there alternatives to Teradata SQL Assistant {on windows}, possibly freeware?
Heidi is great but it does not support Teradata.
What are the pros and cons of the alternatives?
Teradata Studio and Studio Express are the long term replacements for SQL Assistant from Teradata. They are developed around an Eclipse based IDE and are available to download from Teradata Developer Exchange.
Database .NET is a Free universal database manager for Windows All,
It supports most popular databases, including Teradata database.
Not sure if the reason is price, but the latest version of SQL Assistant is downloadable free from Teradata's website:
http://downloads.teradata.com/download/tools/teradata-studio
You don't have to install everything, just pick and choose what you want.

Using Storm in Cloudera

I have been looking to use Storm which is available with Hortonworks 2.1 installation but in order to avoid installing Hortonworks in addition to a Cloudera installation (which has Spark in it), I tried to find a way to use Storm in Cloudera.
If one can use both Storm and Spark on a single platform then it will save additional resources required to have both Cloudera and Hortonworks installations on a machine.
You can use storm with Cloudera installation. You will have to install it on your own and maintain it as such. It will not be part of the Cloudera stack but that should not stop you from using it along with Hadoop if you need it.
You can use Storm on any of the vendor platform. However, storm cluster management is something you have to consider. Storm is not part of the CDH distribution. Cloudera Manager does not manage the lifecycle of the storm services and configurations, nor does it monitor the storm cluster, unless you are willing to write a Clouderea Manager extension yourself. On the contrary, if you choose a vendor such as HDP, the Ambari management tool on HDP provides all the above management features.
If you have a streaming project on CDH, you should strongly consider Apache Spark first, as it provides the same programming model for both batch and streaming processing. You do not need to learn a new API. However, Apache Spark streaming is micro-batch. Thus in use cases that requires sub-second low latency real-time processing, Storm is more suitable.
You can use Storm alongside Cloudera.
All the above are true, but why would you?
Spark includes Spark Streaming, which allows you to handle data processing and stream/event processing workloads using a single API. Spark/Streaming is already inside CDH.
So, why burden yourself with two different APIs?
You can install Apache Storm on Cloudera VM.
For a basic setup and test run, follow below link:
https://github.com/vrmorusu/StormOnClouderaVM/wiki/Apache-Storm-on-Cloudera-VM
This should get you started on developing Storm applications on Cloudera VM.

what is the full Teradata Client software? [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 4 years ago.
Improve this question
I read couple of online documents to get start with the Teradata basics.
I am curious to know what is full client version of Teradata client software to install on Windows/unix/Linux environments? What is best suited for client install?
I see different client versions depends on the type drivers/applications
1. Teradata JDBC
2. Teradata ODBC
3. Teradata dot net provider
4. Teradata SQl assist/BTSE
5. Teradata Express
6. Teradata OLE DB provider
7. TTU .... etc
To more clarity on my Question .. DB2 and Oracle have a full client software which can install locally on windows/unix systems and it will be able to use or all type connectivity(odbc,jdbc,ole db,ado .net...etc)
Typical base toolset for end users/developers submitting ad hoc queries from each environment:
Windows (TTU 13.10 Client Install - 400MB)
Teradata SQL Assistant
Teradata .Net Provider and/or ODBC Driver
Teradata GSS & ICU Libraries
(Optional) WinBTEQ
(Optional) Teradata CLI - for WinBTEQ
Linux/UNIX/Mac OS X
Teradata JDBC Driver
Teradata Studio Express
(Optional - if using Eclipse)Teradata Plug-in for Eclipse
Many of these components can be found in the Teradata Developer's Exchange Downloads section. Starting with Teradata 12 they have reset the version numbers in an attempt to address the confusion with the different development life cycles of each component contained within their Tools and Utilities. Tools for Teradata 13.x should be compatible with Teradata 12 and Teradata 14. However, certain features specific to a given database release may not be available unless you are using a version of the client that is equal to or greater than the database version.
For ETL Development, then you will be relying upon the BTEQ, FastLoad, FastExport, MultiLoad combined with the Teradata CLI to connect to Teradata.
This answer is for Windows only.
To run queries against a Teradata database, you need the following:
Teradata SQL Assistant
Teradata .NET Data provider
Teradata ODBC Driver
To know, if your system has Teradata ODBC Driver, goto Control Panel, Administrative Tools -> Data Sources -> Add and see if Teradata is listed.
Sample screenshot below:
If you are working on new database and ( if you have license to use ), you can have a look at Teradata parallel transporter. It provides extract-load capabilities with configurable scripts.

Using TeraData for not so big applications

Few months back I saw TeraData Express Edition. I have no idea of this beast. I want to know whether it still comes with Express Edition and whether it is a good idea to use this database for Windows based mid-sized apps.
It really depends on what you want to do with this application. From an application perspective, a great weakness of Teradata is that it does not support read committed transaction isolation. If you are attempting to use Teradata as an OLTP database, then you might want to try something else. If you are using it to crunch numbers, then yes, go with it. The one issue is that Teradata Express Edition is not supported that well. Express edition is essentially a snapshot of the database for a certain release. If you find and report a bug, it will take a long time for you to receive a fix. Teradata only releases the express editions once per db release. However (imo), if you buy the real version, you will receive a pretty quick patch which will be rolled into the database software.
I use teradata in my technical support work. I work on database middleware, and Teradata is one of our supported data stores. Define mid-sized? 1-200 transactions per second? I'd stand ANY commonly used database up against that. 10000 tps? Maybe not - maybe you go to the enterprise edition.

Resources