Skip to content

Releases: smalot/pdfparser

v1.1.0

16 Aug 07:07
43e436f
Compare
Choose a tag to compare

Maintenance and small performance boost

PDFs with images can be parsed with less resource consumption (like memory) from now on. @Connum added a feature with #441 to ignore image data. It must be enabled manually though. You can do it easily:

use Smalot\PdfParser\Config;
use Smalot\PdfParser\Parser;

$config = new Config();
$config->setRetainImageContent(false);
$parser = new Parser([], $config);
// $parser->parseFile (...)

Besides that, we fixed a problem with Scrutinizer (part of our test infrastructure).

v1.0.2

21 Jun 07:50
35c8812
Compare
Choose a tag to compare

Bugfix release

  • Don't throw an exception if there is no base encoding defined (as of PDF 1.5 Reference Table 5.11) - #433, thanks @LucianoHanna

v1.0.1

08 Jun 06:46
b32bb7a
Compare
Choose a tag to compare

Bugfix release

v1.0.0

28 Apr 08:00
d4148fd
Compare
Choose a tag to compare

Highlights

  • Removed support for PHP 5.6 and 7.0, requires at least PHP 7.1 or newer❗
  • extended Config.php with white space characters: it allows developers to override regex for white space recognition (#411, thanks @LucianoHanna)
  • Fixed some test-infrastructure related issues (#412, #413, #414)

v0.19.0

14 Apr 07:00
1895c17
Compare
Choose a tag to compare

Bugfix and feature release

Features:

  • Add support for PDF 1.5 Xref stream (#400, thanks @smalot)
  • Add support for Reversed Chars instruction in BMC blocs (#402, thanks @smalot)

Fixes:

  • Encoding::__toString complies with PHP specification from now on (#407, thanks @igor-krein and others from #85)
  • fix Call to a member function getFontSpaceLimit() on null (#406, thanks @xfolder)
  • Consider all PDF white-space characters in object header (#405, thanks @LucianoHanna)

v0.18.2

25 Feb 08:12
b6db6aa
Compare
Choose a tag to compare

Maintenance release

  • Bugfix for #391 (Uncaught Error: Call to undefined method Smalot\PdfParser\Header::__toString() in /var/www/vendor/smalot/pdfparser/src/Smalot/PdfParser/Font.php) (thanks @fsmoak)
  • Addition of an alternative autoloader for non-Composer installations (#388). Based on the work of @apmuthu and others from #117.

v0.18.1

12 Jan 08:59
b47f264
Compare
Choose a tag to compare

Bug fix release

Fixes an infinite loop (and memory leak) if xref table is corrupted. For more information see #377 and #372. Thanks @partulaj!

v0.18.0

30 Dec 18:04
8b8a157
Compare
Choose a tag to compare

🎆 Happy new year release! 🧨

A few bug fixes and improvements.

Fixes:

  • Implemented missing __toString method in Encoding.php (thanks @tomlutzenberger, #378).
  • In Header.php make sure init is only called if $element is of type Element (thanks @lukgru, #380).

Improvements:

  • Improved performance in ElementName.php (thanks @mardc21, #369)
  • Added a config object to adapt default values like font space limit (thanks @k00ni, #375). Further values may be ported in future versions.
  • Switch from Travis to Github Actions (thanks @j0k3r, #376)

v0.17.1

30 Oct 08:15
dc1e422
Compare
Choose a tag to compare

Hot fix release for a problem in PdfParser\Encoding\PostScriptGlyphs.php, for instance:

Notice: Undefined offset: 67 in pdfparser\src\Smalot\PdfParser\Encoding\PostScriptGlyphs.php on line 1091

Related issues: #359, #360

v0.17.0

12 Oct 11:33
722061c
Compare
Choose a tag to compare

Bug fix release with a few improvements and a new composer dependency.

Highlights:

  • added symfony/polyfill-mbstring to improve PHP 8 support (#337)
  • reverted 4f4fd10 and preserving fix for #260, fixing #319, #322 and #334 (#342)
  • revived #257: Properly decode ANSI encodings (#349)
  • allow for line breaks when splitting xrefs for id and position, fixes #19 (#345)
  • Document::getPages() should only ever return elements of type 'Page' (#350)
  • rely on getTextArray() in getDataTm() to extract the texts (#340)
  • fix missing BT command before each section (could result in wrong coordinates) and its resetting of Tm (#341)