deblob-check using lots of memory (was: Re: linux-22.214.171.124-libre1: suitable for Free System Distributions)
lxoliva at fsfla.org
Tue Mar 31 21:36:57 UTC 2009
On Mar 21, 2009, Rubén Rodríguez Pérez <ruben at es.gnu.org> wrote:
> It takes like five seconds per file when it works, and sed gets to use
> more than a GB of memory in that time, is that normal?
Unfortunately, it is, now. We really are building some monster regular
expressions there. I wonder if some different language, using some
different regex engine, would fare better...
But then, converting from sed regexes \(a\|b\) to some extended (a|b)
notation that enabled grouping without discarding, although cleaner,
would be kind of painful.
I'm kind of thinking perl, in part because I've wanted look-ahead
matches for some stuff, but I'm open to suggestions. I guess playing a
bit with s2p, just to see how it goes, makes some sense...
Alexandre Oliva, freedom fighter http://FSFLA.org/~lxoliva/
You must be the change you wish to see in the world. -- Gandhi
Be Free! -- http://FSFLA.org/ FSF Latin America board member
Free Software Evangelist Red Hat Brazil Compiler Engineer
More information about the linux-libre