Download all the Instagram JSON data and media for a user.
By default apps created for the Instagram API are in sandbox mode, and are only allowed to download the last 20 posts. They can also only access your user account and up to 9 other user accounts that accept your application request.
npm install instagram-download
You may want to use the
--global option when installing if you want to use the CLI globally on your machine.
// Download data for a user to a directory using an API access token// Read previously downloaded data for a user from a directory
All the same options are available to the CLI.
# Download data for a user to a directory using an API tokeninstagram-download --token=TOKEN --user=USER --dir=DIR [--refresh --full]# Read previously downloaded data for a user from a directoryinstagram-download --read --user=USER --dir=DIR
The id of the Instagram user that you want to download.
The directory where you want to download all the data. It will be created if it does not exist. Inside the directory it will create a structure so that multiple users can be downloaded. See the
json section below about how the data is stored inside those directories.
dir├── USER1│ ├── json│ └── media└── USER2├── json└── media
token(string, required for
Note that as of June 1, 2016 API requests can no longer be made with using an application's client+secret (as this module did previously), and must use an access token.
refresh(boolean, optional, default
By default running
instagram-download again will start after the most recent Instagram post that was downloaded previously, so that you can easily only fetch the latest Instagram data. Use the
refresh option to overwrite all the existing Instagram data. Note that Instagram photos are never redownloaded because they should never change after being posted.
full(boolean, optional, default
By default the Instagram API only includes a few likes and comments with each post. You have the option (at the expense of two extra API requests per post) to fetch as many likes and comments as Instagram allows (which right now is ~120 each). You shouldn't hit any rate limits when using this option unless you have more than
2462 posts. See the API Rate Limiting section below for more detailed info about this.
images hash in each json file will contain the following keys from Instagram
standard_resolution. The downloader will also do some magic to attempt to download the higher resolution versions (both cropped and uncropped) if possible, and those keys will be
Once everything is downloaded you'll see the following directories:
json directory will consist of an
INSTAGRAM_POST_ID.json file for each Instagram post. The
media directory will consist of all the media (images and videos) with a directory structure that mirrors that pathnames from where they are hosted by Instagram. This is done so that if you look inside an
INSTAGRAM_POST_ID.json file, you can easily find the images files by prefixing the url host + path with the path to the media directory for that user:
$DIR/$USER_ID/media. Here's an example:
The standard resolution image for that post will be located at
_cache/1640745920/media/scontent.cdninstagram.com/hphotos-xfa1/t51.2885-15/e15/11312306_899995390069692_1338680988_n.jpg. Here is some JS for how you'd go about getting that:
const BASE = '_cache/1640745920/'const JSON_DIR = 'json/'const MEDIA_DIR = 'media/'const POST_ID = '1002451038433600709_1640745920'const post =const host path =console
Since the Instagram API now requires an access token for all requests, this module has a method for getting one.
public_content. See the Instagram API docs for other possible scopes.
instagram-download --get_token --client=ID --secret=SECRET# Or to request other scopesinstagram-download --get_token --client=ID --secret=SECRET --scope=basic --scope=likes --scope=comments
This has changed as of June 1, 2016. See the new info in the Instagram docs on rate limiting.
This module uses
debug to selectively log events. By default nothing is logged, but you can use the
DEBUG environment variable to enable logging. All debug logging events use the namespace prefix
instagram-download: with the suffixes
media. Here are some examples:
# Log everthingDEBUG=instagram-download:* instagram-download ...# Log only download eventsDEBUG=instagram-download:download instagram-download ...# Log download and api eventsDEBUG=instagram-download:download,instagram-download:api instagram-download ...
This is written in ES6 and compiled to ES5 using
babel. The code you require will come from the
lib/ directory which gets compiled from
src/ before each