[asdf-devel] Update: encoding file options version.

older
[asdf-devel] Fwd: Faster loading...

Douglas Crosher

18 Apr 2012 18 Apr '12

9:24 a.m.

A revised version of ASDF replacing the system definition :encoding support with support for reading the encoding from file options is now available at: http://www.scieneer.com/files/asdf.lisp Changes: * Can read the file options in a wider range of files. Also supports C, Python, XML, and probably others. It's more robust and may be of some utility in other code. * EBCDIC support removed to save on the line count. Can add this back at a cost of about 40 lines. * UTF-8 detection removed to save on the line count. Still detects a UTF-8 BOM, and reads a UTF-8 encoding file option. Adding a UTF-8 encoding file option will help other tools too anyway. * Removed the lengthy external-format translation table to save on the line count. It should be easy for CL implementations to include more aliases so move the burden of maintaining the aliases to the CL implementation. CLISP and the Scieneer CL already support an extensive range of codes and aliases, and an update set of aliases has been sent for CMUCL. SBCL, ECL, and CCL all have a useful set of aliases, but could consider adding a few more. A hook is include for translating the external-formats just in case it is needed, and it is a list of functions to call in turn so multiple projects could add hooks. * Changed the asdf.lisp coding file option to us-ascii from utf-8 - it's a highly portable tool and this should be adequate. * This version is now down to just 114 lines more than the standard version, and the bulk of the code is just one function. * Tested on CCL, CLISP, CMUCL, ECL, SBCL, and Scieneer CL, * Added a large set of test files that exercise the reading of the encoding file option and try to include enough characters from each character set to check that the encoding option has been successful mapped to an appropriate external-format. This includes all the encoding in linux 'iconv -l' that include support for the characters needed by CL, but excludes the EBCDIC codes. All 629 tests pass on the Scieneer CL, 628 on CLISP, and a much smaller but useful sent on CCL, CMUCL, ECL, and SBCL due to their limited aliases and limited encoding support. See: http://www.scieneer.com/files/coding-tests.tar.bz2 Keep in mind that even if a library author chooses an encoding that your CL implementation does not support, if they have added the encoding file option then the files can be reliably and automatically converted to something supported such as UTF-8. There may be a tool out there already in the Emacs or Python community to do this, or something can be written and could run on CLISP which is free and has extensive coding support. Regards Douglas Crosher

Show replies by date

Raymond Toy

18 Apr 18 Apr

6:58 p.m.

On 4/18/12 7:24 AM, Douglas Crosher wrote:

...

* UTF-8 detection removed to save on the line count. Still detects a UTF-8 BOM, and reads a UTF-8 encoding file option. Adding a UTF-8 encoding file option will help other tools too anyway.

Does anyone use a BOM with utf8? It's not required, doesn't do anything except consume 2 octets, but it's not disallowed either.

...

* Removed the lengthy external-format translation table to save on the line count. It should be easy for CL implementations to include more aliases so move the burden of maintaining the aliases to the CL implementation. CLISP and the Scieneer CL already support an extensive range of codes and aliases, and an update set of aliases has been sent for CMUCL.

The updated set of aliases for CMUCL will be in the next snapshot. Thanks!

...

* Added a large set of test files that exercise the reading of the encoding file option and try to include enough characters from each character set to check that the encoding option has been successful mapped to an appropriate external-format. This includes all the encoding in linux 'iconv -l' that include support for the characters needed by CL, but excludes the EBCDIC codes. All 629 tests pass on the Scieneer CL, 628 on CLISP, and a much smaller but useful sent on CCL, CMUCL, ECL, and SBCL due to their limited aliases and limited encoding support. See: http://www.scieneer.com/files/coding-tests.tar.bz2

Is there any "mandatory" set of encodings that asdf wants? Presumably that means at least iso8859-1 and utf8. Anything else is a bonus? Ray

Douglas Crosher

8:26 p.m.

On 04/19/2012 09:58 AM, Raymond Toy wrote:

...

On 4/18/12 7:24 AM, Douglas Crosher wrote:

...
* UTF-8 detection removed to save on the line count. Still detects a UTF-8 BOM, and reads a UTF-8 encoding file option. Adding a UTF-8 encoding file option will help other tools too anyway.

Does anyone use a BOM with utf8? It's not required, doesn't do anything except consume 2 octets, but it's not disallowed either.

Yes, there are files in Quicklisp releases with a UTF-8 BOM. Note that these will cause an error when incorrectly loaded in an 8 bit locale - another counter example for those who think UTF-8 lisp source will load and run as expected in an 8 bit locale!

...

...
* Removed the lengthy external-format translation table to save on the line count. It should be easy for CL implementations to include more aliases so move the burden of maintaining the aliases to the CL implementation. CLISP and the Scieneer CL already support an extensive range of codes and aliases, and an update set of aliases has been sent for CMUCL.

The updated set of aliases for CMUCL will be in the next snapshot. Thanks!

...
* Added a large set of test files that exercise the reading of the encoding file option and try to include enough characters from each character set to check that the encoding option has been successful mapped to an appropriate external-format. This includes all the encoding in linux 'iconv -l' that include support for the characters needed by CL, but excludes the EBCDIC codes. All 629 tests pass on the Scieneer CL, 628 on CLISP, and a much smaller but useful sent on CCL, CMUCL, ECL, and SBCL due to their limited aliases and limited encoding support. See: http://www.scieneer.com/files/coding-tests.tar.bz2

Is there any "mandatory" set of encodings that asdf wants? Presumably that means at least iso8859-1 and utf8. Anything else is a bonus?

I would like to see ASDF continue to work using just ASCII, and to support other encodings. Not every user wants to use it with Unicode projects. UTF-8 support is adequate if you are prepared to convert to this and use it and I suspect this will likely suit a lot of users.

...

Ray

_______________________________________________ asdf-devel mailing list asdf-devel@common-lisp.net http://lists.common-lisp.net/cgi-bin/mailman/listinfo/asdf-devel

Faré

7:12 p.m.

On Wed, Apr 18, 2012 at 10:24, Douglas Crosher <dtc-asdf@scieneer.com> wrote:

...

A revised version of ASDF replacing the system definition :encoding support with support for reading the encoding from file options is now available at: http://www.scieneer.com/files/asdf.lisp

I thank you for your interest and your code, Douglas; however, unless you're willing to bear the curse of ASDF maintainership (which I hope you're not going to claim lightly, for I will yield it happily), I'd appreciate it if you tried to do things my way: for now, I'd like encoding autodetection to be done in the asdf-encodings system. I understand this may be frustrating for you, especially since for some reason you truly hate the idea of a :encoding specification (though I'm not sure I understand why). However, keeping the encodings support separate, even temporarily, has several advantages: * it allows this particular fast moving code to evolve and be refined without burdening asdf, and without having to cast in stone design choices made before we fully understand the issues. * it keeps ASDF small for most people, yet allows the extension code to grow big. For instance, not only can the extension itself can have many long files and depend on many libraries, it is conceivable that a far future version of asdf-encodings could translate on the fly between encodings not supported by an implementation and utf-8 as supported by the implementation (this would require additional hooks in ASDF). * It fits the minimalist design goal of ASDF, whereby ASDF tries to contain only the bare minimal to build Lisp software, together with enough extension hooks so that desired features can be built on top. I believe this has been a relative success so far, with extensions including CFFI-GROVEL, HU.DWIM.ASDF, POIU, ASDF-DEPENDENCY-GROVEL, plus various things used at ITA that I know, and possibly more used other places that I don't. And yes, if and when we reach some stable, widely accepted design and implementation in asdf-encodings, it will be time to consider integrating it into ASDF itself. However, unlike the case of the source-registry and the output-translations layers, where bootstrapping was an issue that made keeping those layers separate a self-defeating non-starter, I believe we can afford (at least for now) to require people who need non-standard encodings to load asdf-encodings separately and to merge that layer at a later date (if ever). As for the specific code you propose, * I asked on #emacs pointers to how Emacs identifies coding. I documented the results in comments in asdf-encodings. The Emacs way differs from your code in various ways. If we are going that way, is there any reason not to "just adopt" the Emacs code? * Does it make sense for a file to have a UTF-16LE header that specifies coding: koi8-r ? I don't think so. Or a pun file that in UTF-16BE says it's UTF-16LE, and the other way around (or a longer circuit)? I think your algorithm tries both too hard (as in this case), and too little (as in cases where Emacs finds a coding and your code doesn't). * All in all that doesn't mean your code is bad, but that probably means we should experiment with it and tweak it, before we declare ourselves satisfied with burning it into ASDF (which is somewhat less easy to upgrade than a casual library). —♯ƒ I'm a polyatheist — there are many gods I don't believe in. — Dan Fouts

Douglas Crosher

11:05 p.m.

On 04/19/2012 10:12 AM, Faré wrote:

...

On Wed, Apr 18, 2012 at 10:24, Douglas Crosher <dtc-asdf@scieneer.com> wrote:

...
A revised version of ASDF replacing the system definition :encoding support with support for reading the encoding from file options is now available at: http://www.scieneer.com/files/asdf.lisp

... I understand this may be frustrating for you, especially since for some reason you truly hate the idea of a :encoding specification (though I'm not sure I understand why).

Sure, lets keep the encoding code output of ASDF - after considering your points below it does seem clear that it is best to keep this output of ASDF. I was trying to address the need for a bundled solution, but this does not appear necessary. However I would ask you to reconsider the impact of releasing ASDF with the :encoding declaration bundled and of recommending its use. If ASDF is released with the :encoding system definition declaration and further offered as the only solution to authors of portable UTF-8 code then some will no doubt start using this and the :encoding declarations will be in system definition files for tools to deal with and to be supported in future. The encoding file option seems to do a better job of solving the problem and is usable by other code. It solves the problem of the system definition file encoding, which the :encoding declaration can't. It can be used by automated recoding tools that are badly needed - there is a path for Quicklisp to automatically recode projects to suit the CL implementation. It can be used by CL implementations for 'load and 'compile-file, and by editors. This seems like the best path for solving the problem. Once the encoding file option is implemented, the :encoding declaration would seem to just be a liability. Code that already uses the :encoding declaration will not assist other tools that look for the encoding file option. For example if there are some Quicklisp projects using an :encoding option then recoding their source becomes more problematic or the tools much more complex. Further there is the problem of what to do in future if there is a conflict between the :encoding declaration and an encoding file option. What if someone does recode files and this adds a coding file option but does not track down and update :encoding declarations in scope? For these reasons I think the :encoding declaration is a dead end and a liability, and that it should not be released, and that you should not be encouraging its use. I suggest that the encoding file option is a good plan, and that this be communicated to users, and that the social solution of having everyone use UTF-8 be toned down.

...

However, keeping the encodings support separate, even temporarily, has several advantages:

Sure, it's a chunk of code that does not seem to really belong in ASDF. Are you sure that Quicklisp does not need any support bundled? Perhaps it can bootstrap from just asdf.lisp, keep itself ASCII clean, build itself, then download and install asdf-encodings which could be ASCII clean too. Sounds like a good plan if it can all work. I was working on a bare minimum as an alternative to the :encoding declaration, there are hooks for the rest. Perhaps even this is unnecessary, in which case please do keep it all out of ASDF. But please also consider removing the :encoding support too, as it does seem like a liability - just keep the hooks and default to :default. Sounds like a good plan. Regarding the hooks, might it be better for them to be lists of functions to call in turn until successful, so that multiple projects can add hooks and still work together?

...

* it allows this particular fast moving code to evolve and be refined without burdening asdf, and without having to cast in stone design choices made before we fully understand the issues.

Same could be said for the :encoding declaration. You may regret releasing it and having to maintain it, deal with conflicts with future file option solutions, and to deal with authors who keep using it, and with tools that don't work with it!

...

* it keeps ASDF small for most people, yet allows the extension code to grow big.

Agreed, and I have been trying to strip it down to a bare minimum that could be bundled, and even this is in a hook that could be replaced when asdf-encodings is loaded.

...

For instance, not only can the extension itself can have many long files and depend on many libraries, it is conceivable that a far future version of asdf-encodings could translate on the fly between encodings not supported by an implementation and utf-8 as supported by the implementation (this would require additional hooks in ASDF).

Great, I am glad you can see a path to support non-UTF-8 too.

...

* It fits the minimalist design goal of ASDF, whereby ASDF tries to contain only the bare minimal to build Lisp software, together with enough extension hooks so that desired features can be built on top. I believe this has been a relative success so far, with extensions including CFFI-GROVEL, HU.DWIM.ASDF, POIU, ASDF-DEPENDENCY-GROVEL, plus various things used at ITA that I know, and possibly more used other places that I don't.

And yes, if and when we reach some stable, widely accepted design and implementation in asdf-encodings, it will be time to consider integrating it into ASDF itself. However, unlike the case of the source-registry and the output-translations layers, where bootstrapping was an issue that made keeping those layers separate a self-defeating non-starter, I believe we can afford (at least for now) to require people who need non-standard encodings to load asdf-encodings separately and to merge that layer at a later date (if ever).

As for the specific code you propose, * I asked on #emacs pointers to how Emacs identifies coding. I documented the results in comments in asdf-encodings. The Emacs way differs from your code in various ways. If we are going that way, is there any reason not to "just adopt" the Emacs code?

It's written in C, and is a big chunk of code, and puts a lot more weight on auto-detection. My code does the bare minimum to read the file options, and it has been tested on every encoding supported by my system that also supports the characters needed for CL code (ebcdic excluded, but all these also works with another 40 lines of code).

...

* Does it make sense for a file to have a UTF-16LE header that specifies coding: koi8-r ? I don't think so.

Yes, it is inconsistent, but it may be better to pass on the file option anyway so the error is detected. There are only a few cases that are detected from the BOM. Keep in mind that a BOM can be added when not appropriate and most decoders will just ignore it and keep working, so reading and returning the file option seems the best path.

...

Or a pun file that in UTF-16BE says it's UTF-16LE, and the other way around (or a longer circuit)? I think your algorithm tries both too hard (as in this case), and too little (as in cases where Emacs finds a coding and your code doesn't).

I am not aware of any cases where my code fails to read the file options, and there is a big set of tests available to confirm this. Keep in mind that detection is not 100% reliable, and there are often multiple encodings that match a file. One concern is that if people start trusting auto-detection and not adding a file option then the mechanism become less reliable - another tool may not have the same detection algorithm or make different assumptions. Reading the file option is reliable, and would likely remain the first thing to check.

...

* All in all that doesn't mean your code is bad, but that probably means we should experiment with it and tweak it, before we declare ourselves satisfied with burning it into ASDF (which is somewhat less easy to upgrade than a casual library).

I am not suggesting releasing the code, just making the progress available. The reading of the file options has been well tested though. Other areas needing work are the translations of the external-formats for each CL implementation, and compatibility with the Emacs codings.

...

—♯ƒ I'm a polyatheist — there are many gods I don't believe in. — Dan Fouts

Faré

21 Apr 21 Apr

5:01 p.m.

...

: Douglas Crosher

I would ask you to reconsider the impact of releasing ASDF with the :encoding declaration bundled and of recommending its use.

I'd like to add :encoding now, if only because if we are to ever use it, we must wait for it to be in an asdf that has already made its way to all/most implementations before it's considered universal enough for libraries to rely on it. Whereas if we don't end up using it, it's easier to remove later. But you're right: I should not actively encourage its use for now, except for people who know what they are doing and are ready to deal with stricter dependencies and possible future change. As an example use of asdf with encodings support, I'm using asdf encodings with lambda-reader, that I just hacked to also work in 8-bit mode, with a system to load it in UTF-8, and another to load it in Latin1 (for testing purposes on SBCL). Punning a same file as two encodings is one reason why I like the :encoding feature. (As a bonus, the lambda-reader-8bit.asd includes code to define a file that gets loaded but not compiled, a feature several times asked for in this mailing-list.)

...

If ASDF is released with the :encoding system definition declaration and further offered as the only solution to authors of portable UTF-8 code then some will no doubt start using this and the :encoding declarations will be in system definition files for tools to deal with and to be supported in future.

...

The encoding file option […] I suppose you mean autodetection

That's one of the reason I wanted UTF-8 to be a default that didn't require any :encoding specification. But if it's going to break things for users, then it's not ideal either. And at least 6 libraries in Quicklisp haven't given me feedback when I warned about this issue. I will give them a few months of delay before I change the default and consider breaking them OK because they're unsupported. based on file contents including any Emacs-style declaration. Indeed it might be the solution... but it forces every non-ASCII file to have a header, unless we have a default like UTF-8. Also, autodetection adds quite a few hundreds of lines to ASDF, especially if we want to do it right, i.e. in a way fully compatible with Emacs. Also the cost of the component encoding feature is ten to twenty lines of code more than the cost of mere encoding autodetection hook (also a few tens of lines), which frankly is tiny as compared to full autodetection support.

...

It solves the problem of the system definition file encoding, which the :encoding declaration can't. It can be used by automated recoding tools that are badly needed - there is a path for Quicklisp to automatically recode projects to suit the CL implementation. It can be used by CL implementations for 'load and 'compile-file, and by editors. This seems like the best path for solving the problem.

I like this approach, but it requires more third-party coding. One of my principles in ASDF is to enable people to do things without them having to wait on other people to do things, i.e. make coupling looser. The :encoding feature reduces coupling with having to wait for quality autodetection and quicklisp support.

...

Once the encoding file option is implemented, the :encoding declaration would seem to just be a liability.

It also provides a slight performance boost by not requiring autodetection. And even Emacs gives precedence to filename-based encoding detection. Additionally, it's much simpler to support and the code to support it is already there. Code in asdf-encodings doesn't support encoding autodetection yet. I'm sorry, but I think that the code you and pjb posted, while very good starts, are not 100% solutions. A 100% solution would be 100% Emacs-compatible.

...

Code that already uses the :encoding declaration will not assist other tools that look for the encoding file option.

It provides an API for querying the encoding used by a component.

...

For example if there are some Quicklisp projects using an :encoding option then recoding their source becomes more problematic or the tools much more complex. Why would you want to do that? And if you do, is it that difficult editing the .asd file? Or is your problem that .asd files aren't declarative enough? In the latter case, we agree; but then you're welcome to help with XCVB.

...

Further there is the problem of what to do in future if there is a conflict between the :encoding declaration and an encoding file option.

Just like in Emacs: the external declaration or manual setting takes precedence.

...

What if someone does recode files and this adds a coding file option but does not track down and update :encoding declarations in scope?

That's a bug. And what if someone writes a wrong coding declaration, or keeps an old declaration after recoding? That's a bug, too. I don't see this as a danger looming on developers; especially since providing a deterministic compilation behavior means that developers will detect such bugs early enough and portably, as opposed to the "it works for me but not for you" hell of the current default behavior. I see an evolving autodetection algorithm as more version hell, and a non-evolving autodetection algorithm as a probable buggy hurdle. Both enabling developers and making them responsible for their choices while giving them predictable deterministic feedback: I call that progress.

...

For these reasons I think the :encoding declaration is a dead end and a liability, and that it should not be released, and that you should not be encouraging its use.

Sorry, I'm not convinced.

...

I suggest that the encoding file option is a good plan, and that this be communicated to users, and that the social solution of having everyone use UTF-8 be toned down.

Even with autodetection, we should still encourage UTF-8, since it's one of the few encodings that is universally supported on all modern Lisp implementations. For instance, Lispworks on Unix supports very few encodings. If you want your code to be maximally portable, please use UTF-8.

...

...
However, keeping the encodings support separate, even temporarily, has several advantages:

Sure, it's a chunk of code that does not seem to really belong in ASDF.

Thanks for agreeing on that.

...

Are you sure that Quicklisp does not need any support bundled? Perhaps it can bootstrap from just asdf.lisp, keep itself ASCII clean, build itself, then download and install asdf-encodings which could be ASCII clean too. Sounds like a good plan if it can all work.

As long as we don't depend on encoding for .asd files themselves, then it's better for the systems that depend on non-UTF8 encodings to depend on asdf-encodings. As for the encoding of .asd files, I propose that we should standardize on UTF-8 (US-ASCII being an acceptable subset of it).

...

Regarding the hooks, might it be better for them to be lists of functions to call in turn until successful, so that multiple projects can add hooks and still work together?

Sorry, I don't see how this could possibly work. I'd rather have an API that makes it clear that someone must be in charge, than an API that makes a mush of responsibility and is an invitation to catastrophic interactions.

...

...
* it allows this particular fast moving code to evolve and be refined without burdening asdf, and without having to cast in stone design choices made before we fully understand the issues.

Same could be said for the :encoding declaration. You may regret releasing it and having to maintain it, deal with conflicts with future file option solutions, and to deal with authors who keep using it, and with tools that don't work with it!

I understand that, but that's a risk I'm willing to take as ASDF maintainer. It's less than twenty lines of code, and only active package maintainers are going to use this feature in the next few months, so that if I change my mind before next year, I expect that I'll be able to back off if I really want, and have same active maintainers follow me (though of course not without my deservedly losing their future good will).

...

...
* it keeps ASDF small for most people, yet allows the extension code to grow big.

Agreed, and I have been trying to strip it down to a bare minimum that could be bundled, and even this is in a hook that could be replaced when asdf-encodings is loaded.

I believe that "always detect the default" is such bare minimum, with the asdf:*default-encoding* currently being :default, but hopefully :utf-8 in the future (future ouch: when we change it, will that be a defparameter or a defvar?)

...

...
As for the specific code you propose, * I asked on #emacs pointers to how Emacs identifies coding. I documented the results in comments in asdf-encodings. The Emacs way differs from your code in various ways. If we are going that way, is there any reason not to "just adopt" the Emacs code?

It's written in C, and is a big chunk of code, and puts a lot more weight on auto-detection. My code does the bare minimum to read the file options, and it has been tested on every encoding supported by my system that also supports the characters needed for CL code (ebcdic excluded, but all these also works with another 40 lines of code).

I suppose I should commit something based on your code for now. But that raises the question: won't we want to "improve" the algorithm to make it more like Emacs? If the algorithm changes, won't that create "interesting" side effects and changes in behavior that bite someone in the back?

...

...
* Does it make sense for a file to have a UTF-16LE header that specifies coding: koi8-r ? I don't think so.

Yes, it is inconsistent, but it may be better to pass on the file option anyway so the error is detected. There are only a few cases that are detected from the BOM. Keep in mind that a BOM can be added when not appropriate and most decoders will just ignore it and keep working, so reading and returning the file option seems the best path.

...
Or a pun file that in UTF-16BE says it's UTF-16LE, and the other way around (or a longer circuit)? I think your algorithm tries both too hard (as in this case), and too little (as in cases where Emacs finds a coding and your code doesn't).

I am not aware of any cases where my code fails to read the file options, and there is a big set of tests available to confirm this.

Keep in mind that detection is not 100% reliable, and there are often multiple encodings that match a file. One concern is that if people start trusting auto-detection and not adding a file option then the mechanism become less reliable - another tool may not have the same detection algorithm or make different assumptions.

Exactly the reason why I only like detection so much, with or without declaration.

...

Reading the file option is reliable, and would likely remain the first thing to check.

...
* All in all that doesn't mean your code is bad, but that probably means we should experiment with it and tweak it, before we declare ourselves satisfied with burning it into ASDF (which is somewhat less easy to upgrade than a casual library).

I am not suggesting releasing the code, just making the progress available. The reading of the file options has been well tested though. Other areas needing work are the translations of the external-formats for each CL implementation, and compatibility with the Emacs codings.

OK. Well, at this point I'm accepting patches to asdf-encodings. —♯ƒ • François-René ÐVB Rideau •Reflection&Cybernethics• http://fare.tunes.org Malthus was right. It's hard to see how the solar system could support much more than 10^28 people or the universe more than 10^50. — John McCarthy

Pascal Costanza

22 Apr 22 Apr

7:48 a.m.

On 22 Apr 2012, at 00:01, Faré wrote:

...

...
: Douglas Crosher

I would ask you to reconsider the impact of releasing ASDF with the :encoding declaration bundled and of recommending its use.

I'd like to add :encoding now, if only because if we are to ever use it, we must wait for it to be in an asdf that has already made its way to all/most implementations before it's considered universal enough for libraries to rely on it. Whereas if we don't end up using it, it's easier to remove later.

But you're right: I should not actively encourage its use for now, except for people who know what they are doing and are ready to deal with stricter dependencies and possible future change.

These two paragraphs show very clearly what's wrong with ASDF (since v2.x): Experimental features are pushed to CL implementations (that mostly pick it up because ASDF has turned into a de-facto standard), instead of figuring out a way that helps the CL community to participate in having a say on what the best solutions for particular problems could be. The decision which concrete experimental feature is put into ASDF is largely in the hand of the maintainers. People who know what they are doing can _always_ go to some place and fetch the concrete experimental feature that they need. Those people _don't need_ a feature to be available in all CL implementations. The only features that should be in all CL implementations are the ones that are stable and mature! The situation before ASDF 2.x, when ASDF was not bundled with CL implementations, was much better! Pascal -- Pascal Costanza The views expressed in this email are my own, and not those of my employer.

Faré

10:36 a.m.

On Sun, Apr 22, 2012 at 08:48, Pascal Costanza <pc@p-cos.net> wrote:

...

On 22 Apr 2012, at 00:01, Faré wrote:

...
I'd like to add :encoding now, if only because if we are to ever use it, we must wait for it to be in an asdf that has already made its way to all/most implementations before it's considered universal enough for libraries to rely on it. Whereas if we don't end up using it, it's easier to remove later.

But you're right: I should not actively encourage its use for now, except for people who know what they are doing and are ready to deal with stricter dependencies and possible future change.

These two paragraphs show very clearly what's wrong with ASDF (since v2.x): Experimental features are pushed to CL implementations (that mostly pick it up because ASDF has turned into a de-facto standard), instead of figuring out a way that helps the CL community to participate in having a say on what the best solutions for particular problems could be. The decision which concrete experimental feature is put into ASDF is largely in the hand of the maintainers.

People who know what they are doing can _always_ go to some place and fetch the concrete experimental feature that they need. Those people _don't need_ a feature to be available in all CL implementations. The only features that should be in all CL implementations are the ones that are stable and mature!

Unlike the support for multiple encodings and their detection, that is indeed experimental and included in the extension system asdf-encodings, the hooks and defaults for encodings in ASDF are indeed mature and stable. They add all of 68 lines to ASDF out of a current count of 4422 (versus 2011 before I took over), of which explicit :encoding vs implicit auto-detection only add about 20 lines. I don't think it's an excessive price. This is a case of ASDF following the principle "Keep things as simple as you can, but no simpler." Encodings is one of those features that cannot be added after the fact without either defeating the purpose or requiring wholesale replacement of ASDF internals in the absence of proper hooks, like upgradability, search path configuration, output translations, around-compile capability, windows .lnk support, a data-model compatible with a parallelizing extension (poiu), ecl compiler mode support (native vs bytecode compiler), logical pathnames support, etc.

...

The situation before ASDF 2.x, when ASDF was not bundled with CL implementations, was much better!

I don't see how. ASDF 2 is fully backwards compatible in this regard, and you can keep loading a manually installed ASDF rather than (require "asdf") if that's what you want, including any old ASDF 1 you used to love. Actually, UNLIKE what was the case with ASDF 1, you can load ASDF 2 on top of a previously loaded or dumped ASDF without breaking everything (I remember sorry experiences trying hard to avoid images dumped by common-lisp-controller). Thus you can use the bundled ASDF to load your own version, and that's what we now do at ITA for better version control. Now if you have great ideas on how to enhance ASDF, we accept patches, and I've announced many times that I was ready to retire should someone else come forward as a maintainer. —♯ƒ • François-René ÐVB Rideau •Reflection&Cybernethics• http://fare.tunes.org The main reason to always choose the simplest explanation is that it leaves least leeway for parasites to manipulate you. If you don't pick the simplest explanation, you're being manipulated.

Douglas Crosher

23 Apr 23 Apr

9:31 a.m.

On 04/23/2012 01:36 AM, Faré wrote:

...

On Sun, Apr 22, 2012 at 08:48, Pascal Costanza <pc@p-cos.net> wrote:

...
On 22 Apr 2012, at 00:01, Faré wrote:

...
I'd like to add :encoding now, if only because if we are to ever use it, we must wait for it to be in an asdf that has already made its way to all/most implementations before it's considered universal enough for libraries to rely on it. Whereas if we don't end up using it, it's easier to remove later.

But you're right: I should not actively encourage its use for now, except for people who know what they are doing and are ready to deal with stricter dependencies and possible future change.

These two paragraphs show very clearly what's wrong with ASDF (since v2.x): Experimental features are pushed to CL implementations (that mostly pick it up because ASDF has turned into a de-facto standard), instead of figuring out a way that helps the CL community to participate in having a say on what the best solutions for particular problems could be. The decision which concrete experimental feature is put into ASDF is largely in the hand of the maintainers.

People who know what they are doing can _always_ go to some place and fetch the concrete experimental feature that they need. Those people _don't need_ a feature to be available in all CL implementations. The only features that should be in all CL implementations are the ones that are stable and mature!

Unlike the support for multiple encodings and their detection, that is indeed experimental and included in the extension system asdf-encodings, the hooks and defaults for encodings in ASDF are indeed mature and stable. They add all of 68 lines to ASDF out of a current count of 4422 (versus 2011 before I took over), of which explicit :encoding vs implicit auto-detection only add about 20 lines. I don't think it's an excessive price.

My main concern was the liability, but after developing support tools further it no longer seems such a big deal having the :encoding declaration released now. I thought the tools would need to update the :encoding declarations in the .asd files which would be very difficult, but the simple solution is to just remove them when doing any recoding. Quicklisp could remove all the :encoding declarations from releases as the coding file options are updated or inserted, or perhaps do some consistency checking between the file encoding and the declared :encoding after loading systems. I do think the :encoding declaration is unnecessary, but tools can deal with it easily. The only issue may be an author using the :encoding declaration to read octets from an encoded file as ISO-8859-1 characters - but who would do that! Regards Douglas Crosher

4939

Age (days ago)

4944

Last active (days ago)

List overview

Download

8 comments

4 participants

participants (4)

Douglas Crosher
Faré
Pascal Costanza
Raymond Toy