Where do empty compilation units belong? (original) (raw)

Alex Buckley alex.buckley at oracle.com
Mon Nov 26 21:44:01 UTC 2018


// Adding compiler-dev since the parsing of files into compilation units is not a Jigsaw issue.

On 11/20/2018 9:14 PM, Jayaprakash Artanareeswaran wrote:

"jigsaw-dev" <jigsaw-dev-bounces at openjdk.java.net> wrote on 21/11/2018 01:56:42 AM: > Jon points out that OrdinaryCompilationUnit will match an empty stream > of tokens (I dislike the syntax-driven optionality here, but it's > longstanding) so the file D.java could be regarded as a compilation unit > with no package declaration, no import declarations, and no type > declarations. > > Per JLS 7.4.2, such a compilation unit is in an unnamed package, and > must be associated with an unnamed module. > > I would prefer 7.4.2 to say only that a compilation unit with no package > declarations and at least one type declaration is in an unnamed > package (and must be associated with an unnamed module; 7.3 should > enumerate that possibility). A compilation unit with no package > declarations and no type declarations would be deemed unobservable by > 7.3, and all these questions about what to do with empty files would > disappear.

That would be perfect and make things unambiguous. But for now, the paragraph above is good enough for me.

Unfortunately, import declarations can have side effects (compile-time errors) so to be sure that the "no package or type decl === unobservable" rule is suitable for a file containing just an import decl, we would have to do a case analysis of how javac and ecj handle the eight combinations of the three parts allowed in an ordinary compilation unit. That's overkill for the situation involving empty files that keeps coming up and that I really want to clarify. I don't think anyone loves that an ordinary compilation unit matches the empty stream, so let's define away that scenario. As Jon said, an empty file doesn't present anything to be checked; there is no compilation unit there, so let's be unambiguous about that.

We can rule out the empty stream in 7.3 with grammar or with semantics. Usually a semantic description is clearest (gives everyone the proper terminology and concepts) but in this case we don't want the description to wrestle with "consists of one, two, or three parts" when the grammar allows zero. So, a new grammatical description is appropriate, and straightforward:

OrdinaryCompilationUnit: PackageDeclaration {ImportDeclaration} {TypeDeclaration} ImportDeclaration {ImportDeclaration} {TypeDeclaration} TypeDeclaration {TypeDeclaration}

The "three parts, each of which is optional" description is still accurate. The package decl part is optional (as long as you have the import decls part and/or the type decls part); the import decls part is optional (as long as you have either the package decl part or ...) ... you get the picture.

I would leave 7.4.2 alone; an ordinary compilation unit with no package or type decls but with import decls is part of the unnamed package (and thus unnamed module) as before, and compilers can handle that, I think.

Any comments?

Alex



More information about the compiler-dev mailing list