Purpose
There is one course which requires us to do research on the methods and theories of some specific period of time. And we need to read all of the papers issued on the top journals in the target period. After briefly browsing, I found that I need to read about 100 papers which is frankly impossible to be done in just two weeks.
Then I just want to know the current citation numbers of this bunch of papers, which can make it easier to decide which one is more worth to read at first. Then I did some work on MATLAB and hope it can help you.Specific Code here
Basic info
- Version: 1.0
- Copyright: HU WEI
- Date: 2017-09-18
- Method: Use MATLAB
- Environment:
- OS: Mac OS X EI Capitan
- MATLAB: R2014b
- Default Web Browser: Chrome Version 60.0.3112.113 (Official Build) (64-bit)
- Google Scholar Language: English
Acknowledge
google2link.mis built on the basis of Fahad Al Mahmood’s work ‘google.m’, please refer to google by Fahad Al Mahmood on File Exchange- About
pdfRead.m, I should totally give credit to Derek Wood. Please refer to Read text from a PDF document by Derek Wood|File Exchange
Prerequisites:
- Make a new folder named
PDF filesand move the PDF files that you want to search for their academic citations in this folder. - Make sure that your network is open;
- This
google_citation.mcan only recognize 3 journals paper:
</i> Management Information System Quarterly
INFORMS Journal of Computing
Information System Research
Usage:
- Run
Google_Citation.mfile and you will get:
citation.matdata file which stores the title and ciataion # of theCiatationfolder which stores the renamed PDF files in the format asCitation #-Old name.pdf;
- If you want to load the data
citation.matfile to check the data, you can turn toimport_demo.mas reference;
Others
Sometimes the Google scholar need you to verify that you are not robot. Just like the error as follow:
The server returned the message: “Service Unavailable” for URL,
‘http://scholar.google.com/scholar?q=Setup+and+Open-Stacks+Minimization+in+One-Dimensional+Stock+Cutting.’
(with HTTP response code 503).
The HTTP response code 503 means that:
The server is currently unable to handle the request due to a temporary overload or scheduled maintenance, which will likely be alleviated after some delay.
You may need to try again later.
TODO
</i> Use Text Mining Techniques to recognize ‘title’, ‘author’ and some other segment automatically
Convert to Python
Build an API