Commit Graph

87 Commits (053f30db932d9b4a9a8956fb4a4ebe29cfd14a45)

Author SHA1 Message Date
Raphaël Vinot 4deb73d245 Use own version of officedissector. 2016-05-09 17:38:32 +02:00
Raphaël Vinot 51615f8887 Handle invalid docx properly 2016-02-01 14:19:51 +01:00
Raphaël Vinot e8de330d34 Proper handling of OOXML docs 2016-02-01 12:34:47 +01:00
Raphaël Vinot 34e7075609 Merge pull request #2 from Dymaxion00/master
Initial working version of EXIF splitting and image format validation…
2015-12-21 00:31:39 +01:00
Eleanor Saitta 53e4570356 Switch back to exifread; PIL's EXIF support sucks. 2015-12-16 16:12:27 -05:00
Eleanor Saitta 53b61d487e Move to PIL for EXIF; add PNG metadata extractor; modularize metadata extraction
Switch back to exifread; PIL's EXIF support sucks.
2015-12-16 16:09:57 -05:00
Raphaël Vinot ecfdeb7b79 Add missing '.' 2015-12-15 10:46:11 +01:00
Eleanor Saitta ca90a08159 Initial working version of EXIF splitting and image format validation by round-trip conversion. 2015-12-10 00:06:36 -05:00
Raphaël Vinot 6bc83f947d Improve readme 2015-11-24 18:03:51 +01:00
Raphaël Vinot 936fc2c2a2 Proper handling of symlinks 2015-11-24 17:45:06 +01:00
Raphaël Vinot f2233aeae1 Improve doc, use trusty in travis. 2015-11-24 15:03:57 +01:00
Raphaël Vinot f44aedac17 Print FS tree for unpacked archives 2015-11-24 11:41:45 +01:00
Raphaël Vinot daec0cd689 Add forbidden extensions 2015-11-24 11:40:56 +01:00
Raphaël Vinot 1a2637b252 Use default python-magic, escape filenames 2015-11-05 16:27:48 +01:00
Raphaël Vinot 03f1d90f33 Code de-dupication 2015-11-05 15:34:22 +01:00
Raphaël Vinot b0d0912ff9 Skip the known extension check if mimetypes fails. 2015-11-05 10:34:03 +01:00
Raphaël Vinot 9079eac90a try to fix magic 2015-11-05 08:57:24 +01:00
Raphaël Vinot 531ab43dae Improve debug, add list of malicious ext 2015-11-05 00:10:30 +01:00
Raphaël Vinot 2669e80ca9 Unpack all archives, debug invalid mimetype 2015-11-03 17:56:42 +01:00
Raphaël Vinot c122ef9db8 Better support of ODF 2015-11-03 15:30:59 +01:00
Raphaël Vinot 5f080e7323 fix call pdfid 2015-11-03 13:04:14 +01:00
Raphaël Vinot d1f1c4fe16 Add new file to travis 2015-11-03 11:12:29 +01:00
Raphaël Vinot cb38f004e1 Initial version of the script to do sanity checks on files
In (pure) python
2015-11-02 18:00:40 +01:00
Raphaël Vinot 7f15b60539 Avoid error on unknown variable 2015-10-27 14:45:12 +01:00
Raphaël Vinot 74fe05cbe1 Do not use subprocess. 2015-10-27 10:24:45 +01:00
Raphaël Vinot 5d848f4787 Add script for specific purposes, add testcase 2015-10-26 17:11:36 +01:00
Raphaël Vinot dc098dd9a8 Make GS conversion safer 2015-06-17 16:50:20 +02:00
Raphaël Vinot a678a1c9f7 Force path to PDFA_def.ps 2015-06-02 16:44:57 +02:00
Raphaël Vinot 84b004c8a9 Better support of PDF/PS docs 2015-05-31 15:36:36 +02:00
Raphaël Vinot fb7e47b10e Fix bug with media processing 2015-05-29 18:00:48 +02:00
Raphaël Vinot 32d70efe29 Merge branch 'master' of github.com:CIRCL/PyCIRCLean 2015-05-29 17:35:14 +02:00
Raphaël Vinot 3b759eb9ab Fix typo, force overwrite on extract 2015-05-29 17:34:55 +02:00
Raphaël Vinot 420e87cbba Do not process a file that has been marked as dangerous. 2015-05-26 18:56:18 +02:00
Raphaël Vinot dcc3c7eda8 WIP: Start unoconv as a listener. 2015-05-26 18:08:57 +02:00
Raphaël Vinot 5d419f711a Python 3 support, run libreoffice headless. 2015-05-18 01:34:41 +02:00
Raphaël Vinot ac372dc59d Fix completely buggy mimetype/extension xcheck 2015-05-17 15:58:59 +02:00
Raphaël Vinot e9d76adb42 Initial commit 2015-05-11 14:32:59 +02:00