nimforum mirror - [Review request] for rosetta code challenge "Lexical Analyzer"

greenfork (orginal) [2019-02-24T00:14:59+01:00] view original

I have successfully solved the challenge from http://rosettacode.org/wiki/Compiler/lexical_analyzer.

Currently this challenge is missing an example in Nim, I'm willing to suggest my solution there. I would like to ask you to review my code and possibly give me some advice on how to write better code. I'm new to Nim and free to suggestions.

The code is ~200 lines so please take a deep breath:

https://github.com/greenfork/rosetta-compiler/blob/master/lexical_analyzer.nim

View readme if needed.

leorize (orginal) [2019-02-24T17:20:06+01:00] view original

Great work!

Personally I think you should build one using lexbase instead of re as re seems to be too big just for this (the C version is written in plain C without re).

Actually if I could find sometime I'd try to build a lexbase version as it seemed fun :)

greenfork (orginal) [2019-02-24T19:43:01+01:00] view original

lexbase is totally the case! In the challenge it is also suggested to use one variant as a "raw" version and one with a lexing support from the language. I also wrote a version which reads one character at a time and it became very dirty very fast as well as it was hard to debug it, so the 2nd, current version is with re :)

Actually it's overwhelming how much support Nim has for parsing grammar, there's lexbase, re, pegs, strscans.

leorize (orginal) [2019-02-25T09:00:44+01:00] view original

I've spent sometime making the lexbase version: https://gist.github.com/alaviss/83a48f0344d55efbebcca3bce35e4157

This beast has ~310 LoC, so it's just ~100 LoC more than your re version :)

After writing mine, I noted a few points in your version that could be changed:

It's possible to assign a string to an enum, see my version and Manual#Enumeration Types. This should allow you to drop tkNames.

You should use object instead of tuple. Unlike other programming languages, object in Nim is fairly lightweight, and doesn't cause any slowdown compared to tuple, and also carries some perks like having a type constructor.

You should reduce the amount of return in your code. In Nim, return should only be used when you need the control flow semantic of it. If you wanted to return a result, use the result variable or an expression at the end of the proc.

Line 202: Use let please :)

P/s: If you are looking to submit your version to Rosetta Code, please submit mine as well, I'm rather lazy creating yet another account :P

Araq (orginal) [2019-02-25T12:24:36+01:00] view original

Sadly I cannot maintain it much, but shameless plug ahead: https://github.com/Araq/lexim

Lexim compiles regular expressions into no-overhead Nim matching code and can be used with lexbase.

greenfork (orginal) [2019-03-02T20:28:36+01:00] view original

I finally got my hands on it and now it's there http://rosettacode.org/wiki/Compiler/lexical_analyzer#Nim.

@leorize I would also like to say that I really appreciate your corrections, they were very useful, thank you.

Mirror of forum.nim-lang.org

4676 :: [Review request] for rosetta code challenge "Lexical Analyzer"