nimforum mirror - Nimble package structure and interop changes

shashlick (orginal) [2020-08-28T04:20:14+02:00] view original

Inviting review and feedback from the community on some Nimble changes that are being planned for the near future. The problem statement and changes are documented on the wiki so as to retain change history.

Link: Package structure and Interop

The goal:

Simplify nimble package creation for users

Retain the original package repository structure

Simplify nimble => nim interop

Maintain backwards compatibility

Please share feedback right here since the wiki doesn't support commenting.

Thanks in advance.

wltsmrz (orginal) [2020-08-28T09:01:23+02:00] view original

Have you looked into using github package registry?

Araq (orginal) [2020-08-28T12:04:13+02:00] view original

I still don't understand how backwards compatibility would be handled. By a nimble --legacy switch?

cblake (orginal) [2020-08-28T13:06:34+02:00] view original

Rather than "warning that exporting multiple packages is imprudent", the default should just be a similar reporting & resolution mechanism for name management of symbols/procs in modules. Specifically, if both pkgA/modA and pkgB/modA (or pkgC...) exist then the compiler tells you to qualify the import to pick the one you want (or warns you). The qualification just uses a slash (/) rather than a period (.).

Like symbols in modules, qualification should also always be allowed for those who want to be explicit about their dependencies. Just as you can always call strutils.find you can always say import pkgA/modA even if there is no ambiguity. This seems like the internally logically consistent name management way to go.

If you really want a nimble warning -- when no installed packages are known to have a conflicting top-level module name -- it could say "only module pkgA is protected from a need to package-qualify". (It is really only protected via the centralized package database not having duplicate package names, but people can always have some private code with --path causing a collision. So, it's not really ever "guaranteed".)

Araq (orginal) [2020-08-28T14:04:38+02:00] view original

Well --path has a strict order so what module you refer to is deterministic. We could warn about ambiguous module imports but there is little need, if the order is wrong, you import the wrong module and you get not found symbols or type mismatches.

zahary (orginal) [2020-08-28T14:15:57+02:00] view original

Please be aware that @bobeff's work on reforming Nimble to support lockfiles addresses some of the presented issues and it's already quite close to completion. Please take your time to study it by examining the test cases. @bobeff is also likely to start working on better documentation quite soon.

I'll try to briefly describe the main ideas in the workflow with lockfiles and then I'll comment on each point in the wiki:

The lock file is a json file that precisely specifies the version of each transitive dependency of your project. It includes information about git revisions and directory content checksums, so the dependencies can be obtained both through git cloning and through tarball downloading. The integrity of all downloaded files is verified through the checksums before the package is installed in the global Nimble cache to prevent package hijacking attacks.

Within the Nimble cache, it's possible to have many versions of the same package installed side-by-side. Within your project, Nimble generates a config file with the precise active paths at any moment. This file is not intended for committing in git, but it will be usually included from your project config file which is stored in git.

Thus, the typical workflow is to invoke Nimble only when something about your dependencies changed while normal builds and IDE plugins continue to use plain Nim, which results in the correct paths being sourced from the config files.

An import new feature is the localized develop mode. Within each project, you can use the nimble develop command to create a local nimble.develop file that includes path overrides for the dependencies that you are currently actively working on. This file is not intended for committing in git. It can "include" other develop files to facilitate a common directory layout that is well-suites for large teams. Here is an example demonstrating how my workspace at Status would look like:


status/
  nim-libp2p/
    ...
  nim-chronicles/
    ...
  nim-eth
    ...
  nimbus/
    ...
    nimbus.nimble
    nimbus.develop # includes ../status.develop
  nim-beacon-chain
    ...
    beacon_chain.nimble
    beacon_chain.develop # includes ../status.develop
  ...
  status.develop # puts all of the Status packages in develop mode

In other words, the presence of a local develop file tells Nimble to prefer the local directories for certain packages instead of referring to the global cache while generating the config file with the active paths.

The Nimble commands associated with the develop mode are designed in a way that makes them easy to integrate with git as pre-commit or post-merge hooks (i.e. executed after git pull). The overarching goal of the lockfile feature is to make it impossible for you to push a broken build to your teammates, so the nimble check command will verify that all of your dependencies are in a clean and published state, that your committed lockfile is up-to-date and so on. On your daily pull, another command will help you get your working copies in a state that matches the fetched lockfiles.

Let's see how these features address the pain points from the wiki:

1. Nimble enforces package structure rules that are confusing and annoying to users

2. A package installed by Nimble does not match how the repo looks originally

@bobeff's work is relatively orthogonal to these, but it does preserve the conceptual separation between a repository and the installed package directory. This is considered an important feature if we want to support monorepos with multiple Nimble packages appearing within one. The src directory quirk is just handled behind the scenes by automatically specifying the correct added path for the package. Just like now, the nimble develop command can be used to create a complete development environment for the package in a local directory where you can run the tests, prepare a PR and so on.

3. Nimble doesn't allow a package to install files during or post-install nor uninstall packages cleanly if any files not installed by nimble are added after the fact

I don't think there are any functionality changes related to this, but the dependency traversal algorithms have been largely rewritten to support parallel downloads and installation of packages. Any development based on the current Nimble source files is likely to conflict with Bobeff's branch, so it may be wise to directly try to implement the changes there.

4. Nim is too aware of nimble implementation details

Addressed through my description of the minimal config files integration.

4. Nim and nimble use different algorithms to find dependencies

Addressed through the precise paths specified in lockfiles and config files.

cblake (orginal) [2020-08-28T14:17:21+02:00] view original

I agree some other compile time error is likely and agree this new warning/error is only a nice-to-have...I thought I said that elsewhere but I cannot find it. Trying to keep it brief. :-) Anyway, the core point is optionality of package qualification unless it's necessary, just like module qualification and export however many names you want.

juancarlospaco (orginal) [2020-08-30T21:51:07+02:00] view original

Rolling Release support.

Git short hash as version.

shashlick (orginal) [2020-08-31T18:56:46+02:00] view original

@Araq - I've updated the wiki with a backwards compatibility section. Please review changes and let me know if you have any feedback.

shashlick (orginal) [2020-08-31T21:05:30+02:00] view original

Responding to some points raised by @zahary.

Regarding package structure, I've added some info in the nim.cfg interop section detailing how multi-package repos can be handled with the same nim.cfg path mechanism. But the main motivation of these changes is to simply the lives of library writers and not require them to fumble around with installExt and related settings or worry about the repo looking different after install.

As for adding support for installing packages from tarballs or other download mechanisms, it can certainly be added but can be discussed separately. Package integrity checking should not be affected by package structure as you have noted.

Parallel downloads sound great and so do any improvements around dependency handling but neither has anything to do with package structure. It is best to break all these improvements into individual PRs which can be reviewed and merged into Nimble master which has over 2k changes since @bobeff's last sync.

Last, the nimble develop enhancements as proposed seem complicated especially from a user perspective. Today, you can develop multiple packages by just running nimble develop pkg1, pkg2, pkg3. By just adding a --recurse or similar flag, nimble could be instructed to setup all dependencies also in nimble develop mode in sibling directories.

Local deps mode might need a little more than that but adding a .nimble-link capability that extends to $nimbleDir itself might solve that puzzle. Introduction of a separate nimble.develop file along with 6 new CLI flags seems unwarranted. Maybe I'm missing something but the diff does not help with understanding the behavior intended. More importantly, I don't see the nimble develop behavior interfering with the package structure proposal.

cblake (orginal) [2020-09-02T22:29:38+02:00] view original

It's not "pollution" IF there is any "Nimonic" package-qualify the module import ability, however that is done, any more than exported procs in a module "pollute" the global symbol namespace. I don't see anything incompatible about setting things up so qualification can work. It seems to me that whatever magic happens to have foo-version/ installed but just import foo work should be adaptable. Some version has to be selected, but that is already the case.

The incredibly common "measured" case in the current nimble DB is no need for qualification. It's debatable if it will stay that way, of course. Unknown future evolution should not hold hostage present day usability.

dom96 (orginal) [2020-09-03T00:29:32+02:00] view original

@cblake There is simply no easy way to achieve this optional-qualification, unless I am missing something. But even if it was easy I wouldn't want it to be possible. For Nim there is a perfectly good reason why function calls do not need to be qualified with the module name by default (UFCS), there is no similar good reason why you shouldn't qualify the modules you import by their package name.

shashlick (orginal) [2020-09-04T01:37:50+02:00] view original

@cblake - this design is basically Araq's idea with all the consequences documented and mapped to the issues that will get resolved.

As for using git worktrees and space efficiency, as interesting as it sounds, it does not seem warranted since it will require quite significant changes to how Nimble functions and introduce additional git awareness into Nimble which will need maintenance. The value add is not significant compared to just having many independent clones. And for the rare large repos, the user can simply nimble develop a single directory.

@dom96 - I agree this proposal is not too different from what we have today as far as namespaces are concerned. Part of the effort here is to make it clear what nimble is pushing for while improving other packaging aspects in the identified issues:

Package doesn't change after nimble install

User does not need to deal with the nuances of binary/hybrid/library packages

Nimble installs everything, builds stuff if bin is defined, excludes stuff if skipX is defined.

cblake (orginal) [2020-09-04T11:41:17+02:00] view original

@dom96 - even if qualification is mandatory (which I think is non-Nimonic), there is still no good reason package structure cannot be all modules at the top directory level. nimble installs them in their own directory! No file collisions are possible. No "pollution" happens in the package or need happen in installs. Any trouble can only be caused by follow-on activity/relationships in nimble itself. Maybe this will be fixable in the new @shashlick world order.

Araq (orginal) [2020-09-04T11:57:04+02:00] view original

Once again, as soon as we require import x / x rather than import x there is no "pollution" of module names. Nimble introduced the pollution, not Nim.

cblake (orginal) [2020-09-04T13:04:19+02:00] view original

I don't think I disagree, but would reaffirm optional qualification means that there is no need to "require" anything. Add the right search path and import x/x is only needed if an import suspects y/x may also be installed. Paranoid import ers can always qualify. Non-qualifiers can always trivially add a qualification if a collision happens. There are more arguments for the way module name handling works than "just UFCS".

dom96 (orginal) [2020-09-04T15:53:21+02:00] view original

Once again, as soon as we require import x / x rather than import x there is no "pollution" of module names. Nimble introduced the pollution, not Nim.

Right, but then this proposal should explicitly state that this is the goal and how we plan to achieve it. The rest of this proposal isn't as important (nor controversial).

shashlick (orginal) [2020-09-04T23:32:00+02:00] view original

Big discussion on #nim today around this topic. If I summarize it correctly:

The global namespace issue is still being debated with push on one side to remove warnings in nimble, with the other side pushing for more enforcement by raising errors. The current implementation is hard warnings and this proposal suggests a softer warning and detailed documentation of how package namespaces could work best for everyone without requiring changes in either nim or nimble or breaking existing user code.

There is no easy solution to the problem - no way to optionally qualify only when there are conflicts. Either nimble has to tweak the package structure even more, which no one wants, or we break user code. If pkgname is injected or pkgname-0.1.0 is renamed to pkgname, we will still break on packages with srcDir declarations - --path won't work.

Meanwhile, two more nimble issues will be addressed by the proposal:

https://github.com/nim-lang/nimble/issues/608

https://github.com/nim-lang/nimble/issues/743

shashlick (orginal) [2020-09-08T17:51:19+02:00] view original

Added two more issues related to the proposal.

https://github.com/nim-lang/nimble/issues/561

https://github.com/nim-lang/nimble/issues/576

Mirror of forum.nim-lang.org

6738 :: Nimble package structure and interop changes

1. Nimble enforces package structure rules that are confusing and annoying to users

2. A package installed by Nimble does not match how the repo looks originally

3. Nimble doesn't allow a package to install files during or post-install nor uninstall packages cleanly if any files not installed by nimble are added after the fact

4. Nim is too aware of nimble implementation details

4. Nim and nimble use different algorithms to find dependencies