Accessing Data in the Cloud
Using SAS to read data from Amazon Simple Storage Service (S3)
seleritysas.com
Accessing Data in the Cloud Using SAS to read data from Amazon - - PowerPoint PPT Presentation
Accessing Data in the Cloud Using SAS to read data from Amazon Simple Storage Service (S3) seleritysas.com What is Amazon Simple Storage Service (S3)? An object store, not a file system Write once, read many (WORM) Eventually
Using SAS to read data from Amazon Simple Storage Service (S3)
seleritysas.com
public-datasets
ecord_data.shtml
✓ Easy to implement ✗ File is retrieved using the http protocol (serially) ✗ The slowest of all options, subject to timeouts for very large files
✓ Very fast, as it uses parallel downloads ✗ Only available from 9.4M4 ✗ Only works with secure S3 files, not public S3 files
✓ Very fast, as it uses parallel downloads ✗ Need to install the AWS CLI on the SAS Server ✗ Need the ability to run X commands on the SAS Server
✓ Treat it like a local disk ✗ S3 is not designed for block storage/access ✗ Potential issues with current storage driver implementations
NOTE: The data set WORK.YELLOW_TRIPDATA_2017_01 has 9710124 observations and 17 variables. real time 36.09 seconds cpu time 33.85 seconds
NOTE: PROCEDURE S3 used (Total process time): real time 3.77 seconds cpu time 6.31 seconds NOTE: PROCEDURE IMPORT used (Total process time): real time 26.75 seconds cpu time 26.75 seconds
NOTE: DATA statement used (Total process time): real time 5.80 seconds cpu time 0.00 seconds
NOTE: PROCEDURE IMPORT used (Total process time): real time 26.59 seconds cpu time 26.59 seconds
Contact michael@selerity.com.au 1300 727 757
seleritysas.com