gmail-mbox-stats

1.0.10 • Public • Published

gmail-mbox-stats

npm version PRs Welcome

gmail-mbox-stats is a very simple tool to analyze your Gmail mailbox.

  • Find the sender which sent most mails.
  • Find the receiver where you sent most mails.

  • Find the domain (@gmail.com, @live.com ...) which appears mostly in sender address.
  • Find the receiver where other senders (where sender is not you) sent most mails.

  • Find the address which is most often placed in CC by you.
  • Find the address which is most often placed in CC by others.

  • Find the address which is most often placed in BCC by you.
  • Find the address which is most often placed in BCC by others.

  • And more.


Video instruction:

Coming soon...



Textual instruction:

  • Download Gmail data from Google Takeout (Preferably select 'Include all messages in Mail', it will include all mail, not just Inbox or just Sent/Spam/Archive/Trash). If your mailbox has 100K mails, the downloaded data can be 10 GB or more. So, be ready to deal with a large file. If it is too large, it may not be a single archive file, but multipart archive files, like split-files of ZIP.

  • Extract MBOX file from the Gmail data file(s).

  • Make sure you have installed NodeJS. It is available for Windows, Mac and also Linux.

  • Open terminal (preferably in the same folder where MBOX file is located) - in Windows/Mac/Linux. For Windows, the terminal should be PowerShell, not CMD.

  • run a command with this syntax:
    npx gmail-mbox-stats mymail="<your email address>" mboxpath="<mbox file path>"

    for example:
    npx gmail-mbox-stats mymail="leodevbro@gmail.com" mboxpath="./All mail Including Spam and Trash.mbox"
    the notation ./ means to find the file All mail Including Spam and Trash.mbox in the current folder of the terminal.









That's it.
Now just see the results:








It will take probably 5-10-15 seconds to analyze 1000 mails (messages),
about 100 seconds for 10K mails,
about 1000 seconds (10-15-20 minutes) for 100K mails and so on.

  • When it finishes, the terminal will log basic information like this:
Success.
Full count of messages: 14577
Messages where sender is me: 425
Messages where sender is not me: 14147
Messages where sender is hidden: 2
Messages where sender is empty: 3

Created new folder "mailStats_2024-04-14_23-50-21"


Start datetime: 2024-04-14_23-50-21
->End datetime: 2024-04-14_23-51-54

Full Execution Time: 1:32.222 (m:ss.mmm)


gmail-mbox-stats v1.0.10
Created by leodevbro (Levan Katsadze)
* linkedin.com/in/leodevbro
* github.com/leodevbro
* facebook.com/leodevbro

If you feel like donating
* buymeacoffee.com/leodevbro
* ko-fi.com/leodevbro
  • Also, there will be a new folder named "mailStats" with execution start datetime,
    like this: mailStats_2024-04-14_23-50-21
    in the same folder where the MBOX file is located.

  • In the 'mailStats' folder, there will be generalStats.csv file. If you import it in Google Sheets it will look like this:


Example generalStats CSV In Google Sheets


  • In the 'mailStats' folder, there will be also two folders:
    forMailsWhereSenderIsMe - the stats for only the mails where sender is you.
    forMailsWhereSenderIsNotMeOrIsUnknown - the stats for only the mails where sender is not you, or sender is unknown.
    In both folders, there will be .csv files of stats. You can import them one by one in Google Sheets.

Here is what the full folder structure looks like:

▨All mail Including Spam and Trash.mbox

📂mailStats_2024-04-14_23-50-21
    ▦generalStats.csv

    📂forMailsWhereSenderIsMe
        ▦me_freqBcc.csv
        ▦me_FreqCc.csv
        ▦me_FreqReceiver.csv --- Here you can find the receiver where you sent most mails
        ▦me_FreqSender.csv
        ▦me_freqSenderDomain.csv
        ▦me_freqSenderPlusName.csv

    📂forMailsWhereSenderIsNotMeOrIsUnknown
        ▦notMeOrUnkn_freqBcc.csv
        ▦notMeOrUnkn_FreqCc.csv
        ▦notMeOrUnkn_FreqReceiver.csv
        ▦notMeOrUnkn_FreqSender.csv --- Here you can find the sender which sent most mails
        ▦notMeOrUnkn_freqSenderDomain.csv
        ▦notMeOrUnkn_freqSenderPlusName.csv


Now, for example, let's import the file notMeOrUnkn_freqSender.csv in Google Sheets:

exampleCsv__notMeOrUnkn_freqSender In Google Sheets

Also, some other files:

me_freqReceiver.csv

exampleCsv__me_freqReceiver In Google Sheets


notMeOrUnkn_freqSenderDomain.csv

exampleCsv__notMeOrUnkn_freqSenderDomain In Google Sheets


notMeOrUnkn_freqSenderPlusName.csv

exampleCsv__notMeOrUnkn_freqSenderPlusName In Google Sheets


notMeOrUnkn_freqReceiver.csv

exampleCsv__notMeOrUnkn_freqReceiver In Google Sheets


notMeOrUnkn_freqCc.csv

exampleCsv__notMeOrUnkn_freqCc In Google Sheets



Thank you.

My name is Levan Katsadze (ლევან კაცაძე), 1995-03-03, from Tbilisi, Georgia (Not USA).

facebook logo youtube logo

If you feel like donating:

Buy Me A Coffee ko-fi




Package Sidebar

Install

npm i gmail-mbox-stats

Weekly Downloads

7

Version

1.0.10

License

none

Unpacked Size

1.18 MB

Total Files

34

Last publish

Collaborators

  • leodevbro