Notice: This website is an unofficial Microsoft Knowledge Base (hereinafter KB) archive and is intended to provide a reliable access to deleted content from Microsoft KB. All KB articles are owned by Microsoft Corporation. Read full disclaimer for more details.

Question: How to use SQL queries with a Revolution R xdf file


View products that this article applies to.

Problem

Is it possible to run SQL queries on an .xdf read into RevoR and running SQL inside RevoR? 

↑ Back to the top


Solution

It is not possible to run SQL on .xdf file read as it is a binary file that contains data and not a database. What allows us to work with such large data is that we can read "chunks" of the .xdf and that allows us to leverage disk resources in addition to memory resources and lets us manipulate Big data.

Options -

Since the xdf file is created by RevoR, you can perform SQL on the data when it is coming into RevoR by adding an SQL query to RxOdbcData. A sample would look like:

foo <- RxOdbcData(sqlQuery = "SELECT * FROM foo_database", 
connectionString = connectionString) 

Of course you will need to have a valid ODBC connection. The RevoScaleR ODBC Import guide has information on this.

Another option is to use rxDataStep to transform the data in the xdf into a dataframe. A dataframe must be held in memory so this may not be an option if you have extremely large .xdf files. You could then use an Open Source package called sqldf which allows you to use SQL Selects on dataframes.

↑ Back to the top


Keywords: kb

↑ Back to the top

Article Info
Article ID : 3104289
Revision : 1
Created on : 1/7/2017
Published on : 10/29/2015
Exists online : False
Views : 169