README
Turntable
A Node.js module, ideal for a chron, that will download data from a Google Spreadsheet and put select columns on an Amazon S3 bucket. To get your spreadsheet key, do File > Publish to the Web
in Google Spreadsheets.
You'll want to create an AWS credentials.json
file per these instructions and put it somewhere like ~/.aws/credentials.json
and type that path into path
in config/aws-info.json
.
Tested on Node 0.10.7
Installation
npm install gdoc-to-s3
Usage
See example.js
Features
- Only uploads the columns you specify in
output_schema
in thegdoc_info
object in case there are fields you use internally that aren't meant for production. For instance, you might have an "Edited by" or "Written by" column that you want to keep in your document but don't need to show publicly. - Uploads two copies of your data: 1) the production copy that gets overwritten each time with new data; 2) a timestamped copy that goes into the
backups
directory. The default directory isbackups
in the same directory as youroutput_path
. You can set your own backup directory in theaws-info
object. With backups, you can easily revert to an old version if necessary.
TODO
- Tests
Twitter updates
You can optionally set up a Twitter bot to deliver notifications by setting use_twitter_bot
to true
in the tweetbot_info
object. This can be used mostly likely on a private account for easy team notifications. Setting @-replies for errors could be an effective notification systems. Successes needn't be so noisy.
A note on data privacy
As long as you don't share you key with anyone, publishing to the web doesn't alter your sharing and security preferences for that doc. If there are columns that you don't want visible in your csv on S3. There are two options:
- If you're okay with that data being accessible if someone know the spreadsheet key, then, in the script, you can specify which columns it will copy over to S3 by naming them in the
output_schema
in thegdoc-info
object. - If you want more security, create a second sheet with a formula like
=Sheet1!A:A
in Column A,=Sheet1!B:B
in Column B and so on. If you copy that formula down, it will take the values from Sheet1 only for the columns you specify. The downside: if you don't copy the formula in Sheet 2 to enough rows, then it won't carry over the data. So you have to keep an eye on it and make sure your formula is in all rows. You'll want to overwrite the ajax url to make sure it grabs the proper worksheet.
Dependencies
AWS SDK
You can install with
npm install aws-sdk
DSV for CSV parsing (extracted from D3.js)
You can install with
npm install dsv
jQuery for Node
You can install with
npm install jquery
Optional
Twitter Client for Node
You can install with
npm install twit