nimforum mirror - nimhuml – A Nim parser and serializer for HUML (Human-Oriented Markup Language)

abhishek (orginal) [2026-03-15T17:58:07+01:00] view original

Hi everyone,

I've been working on nimhuml, a Nim implementation of the HUML parser and serializer.

What is HUML?

HUML (huml.io) is a serialization language for documents, datasets, and config files created by Kailash Nadh. It looks like YAML but is intentionally stricter, one canonical way to write everything, no indentation ambiguity, no silent footguns.

What does nimhuml do?

Parse HUML strings and files into JsonNode

Serialize JsonNode back to HUML

Full HUML v0.2.0 spec support: scalars, inline and multiline dicts/lists, multiline strings, comments, hex/octal/binary numbers, version directives

Links:

GitHub: https://github.com/w3Abhishek/nimhuml

HUML spec: https://huml.io

This is still early (v0.2.0) so feedback, bug reports, and contributions are very welcome. Would love to hear from anyone who's been looking for a cleaner alternative to YAML in their Nim projects.

ASVI (orginal) [2026-03-15T19:53:30+01:00] view original

It's unclear why it parses into a JsonNode. I think it would be much better to create something like a HumlNode instead

Hobbyman (orginal) [2026-03-15T22:23:49+01:00] view original

Oh an indentational serialization / markup lang, thats interesting and also befitting the python and nim community. With which data-langs is Huml isomorph (i mean like translatable without loosing info)?

icedquinn (orginal) [2026-03-17T15:14:59+01:00] view original

It's unclear why it parses into a JsonNode. I think it would be much better to create something like a HumlNode instead

I'd guess because its the same underlying data structure and it makes it fit in with the rest of Nim codebases.

With which data-langs is Huml isomorph (i mean like translatable without loosing info)?

HUML is basically a projection of jSON that looks vaguely YAML with a very unambiguous parser. You have to use :: for "vectors" rather than it being inferred from context.

Its a language I've looked at in the past and concluded I wasn't very interested in. NestedText is very close to the YAML feel that people tend to actually use. There is a caveat that the only data types are maps, lists, and string, though every data type is realistically passing through strings in production anyway.

Ghazali (orginal) [2026-03-17T17:43:19+01:00] view original

The code looks like Direct translation of Python version. Also the code Probably looks like vibecoded maybe You have used Claude ? But that's not the Problem for me as far as You provide disclaimer and some kind of benchmarks.

It will be better for Nim version to split into files instead of single large file and better to maintain it, Clumping into single file I personally don't like as Nim is an expressive language.

ASVI (orginal) [2026-03-17T18:12:30+01:00] view original

I don't really agree about the single large file. 900 lines of code is small enough to keep the whole thing in your head. It’s actually harder to navigate a codebase split into dozens of 30-line files than one 900-line file. If modules have a clear separation of concerns and fully implement their part, it’s much easier to understand. Basically, I’d prefer 3 files with 300 lines each over 30 files with 30 lines, as long as those 3 files have a clear purpose and logical structure. Regarding the code, I’d suggest the author use nimlexbase and llstream — they already come with the necessary batteries included

Ghazali (orginal) [2026-03-18T03:32:01+01:00] view original

I don't mean 30 files and I don't know where You got it ? And 30 lines per file is very small I never meant it.

I mean to split files based on functionality for easy to maintain and good practice, I didn't set any upper limit but just advice to split based on function codes does not on loc.

I just mean to split like : nimhuml.nim (public api), parser.nim, writer.nim that's it and if needed errors.nim.

And 30 loc per file some children make such kind of code ? I don't know why You conclude that from my opinion <3

And it's just personal opinion there is no compulsion

icedquinn (orginal) [2026-03-18T18:54:34+01:00] view original

recursive descent parsers taking up hundreds of lines of code is to be expected. my current peg repo parses ford's peg notation in ~900 SLOC and that's genuinely irreducible. i'd prefer if the files were split by concern (reader, writer, document model) but my nim coding styles are particularly nonstandard.

HUML and NestedText are pretty thin specs and if you're reusing the json module's document format this isn't particularly offensive.

raise p.error("trailing spaces are not allowed")

This part is a bit anachronistic perhaps. Araq doesn't really want us using exception throwing going forward. I actually prefer it (but then I've always been more Ada-aligned) but this kind of bailout is discouraged these days.

Araq (orginal) [2026-03-18T20:17:52+01:00] view original

Araq doesn't really want us using exception throwing going forward.

Not always but in your case, sure, parsing errors are so easy to make "keep going", store the first error in the parser object and count further errors. Then offer a real API for it. Something like that.

icedquinn (orginal) [2026-03-20T19:23:16+01:00] view original

I'm still not sure why continuing to parse a failed document is seen as a desirable thing. It can require contextual fix-ups that result in misunderstanding a document and generating error noise.

ex. GCC encounters a typo, fails to understand, assumes its an integer and keeps going, generates 30 more errors about all the things an integer can't do. those errors are worthless because none of the code actually tried to do those things to integers.

it seems largely like parser writers just flexing.

darkestpigeon (orginal) [2026-03-20T20:58:00+01:00] view original

IDEs? You want hints, go-to-definition etc. to work for files that are being edited and contain code that won't compile.

Mirror of forum.nim-lang.org

13795 :: nimhuml – A Nim parser and serializer for HUML (Human-Oriented Markup Language)