google

A module to search and scrape google. This is not sponsored, supported, or affiliated with Google Inc.

Node.js - google

This module allows you to search google by scraping the results. It does NOT use the Google Search API. PLEASE DO NOT ABUSE THIS. The intent of using this is convenience vs the cruft that exists in the Google Search API.

This is not sponsored, supported, or affiliated with Google Inc.

Please do not post an issue, email me, tweet me, or in anyway contact me about getting around Google blocking your automated search requests. These sorts of requests are outside the scope of this module. Google has every right to block consumers of their service for any reason. See: #27, #20.

npm install --save google

This prints out the first 50 search results of the query node.js best practices.

var google = require('google')
 
google.resultsPerPage = 25
var nextCounter = 0
 
google('node.js best practices', function (errnextlinks){
  if (err) console.error(err)
 
  for (var i = 0; i < links.length; ++i) {
    console.log(links[i].title + ' - ' + links[i].link) // link.href is an alias for link.link 
    console.log(links[i].description + "\n")
  }
 
  if (nextCounter < 4) {
    nextCounter += 1
    if (next) next()
  }
})

You can specify results in a specific timeframe. Working values listed below:

var google = require('google')
 
// assign one of the values below. Nothing is set by default. 
google.timeSpan = 'h' // information indexed in the past hour 
google.timeSpan = 'd' // information indexed in the past day 
google.timeSpan = 'w' // information indexed in the past week 
google.timeSpan = 'm' // information indexed in the past month 
google.timeSpan = 'y' // information indexed in the past year 

You can also specify the TLD of the Google search page and the language. If you change the language you must translate the next page results text to detect the corresponding link.

var google = require('google')
 
google.lang = 'de'
google.tld = 'de'
google.nextText = 'Weiter'
 
google('node.js best practices', function (errnextlinks){
  …
})

You can specify the options to be passed to request, see the request module for all available options.

var google = require('google')
 
google.requestOptions = {
  proxy: 'http://user:password@192.168.5.4:80',
  timeout: 30000,
  localAddress: '127.0.0.1',
  jar: true,
  headers: {
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
    'Accept-Encoding': 'gzip, deflate',
    'Accept-Language': 'en;q=0.5',
    'Cache-Control': 'max-age=0',
    'Connection': 'keep-alive',
    'DNT': 1
  }
}
 
google('node.js best practices', function (errnextlinks){
  …
})

Licensed under MIT. See LICENSE for more details.

Copyright (c) 2012-2015 JP Richardson