Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 4 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -42,8 +42,10 @@ An aggregated file (metadata.csv) contains all the CSV metadata.
#### Perl
Faster and richer (more metadata) than the XSLT scripts.

One Perl script : extractMD.pl
For each document, its metadata are stored in the STATS folder under your prefered formats : XML, JSON, CSV, txt
One Perl script: extractMD.pl
One shell script: batch.sh (runs the Perl script and packages the files)

For each document, metadata are stored in the STATS folder (available formats : XML, JSON, CSV, txt)


##### Test
Expand All @@ -52,7 +54,6 @@ For each document, its metadata are stored in the STATS folder under your prefer
3. >perl extractMD.pl DOCS xml json



### Charts
See on [Github](https://github.com/altomator/EN-data_mining/tree/master/Charts) and [here](http://altomator.github.io/EN-data_mining/).

Expand Down