Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir. Lexer in Elixir.
Go to file
James Harton a36c3e9395
All checks were successful
continuous-integration/drone/push Build is passing
chore: Update forgejo hostname.
2024-02-05 15:40:53 +13:00
config chore: Update forgejo hostname. 2024-02-05 15:40:53 +13:00
lib feat: Allow actions to return a list of results. (#5) 2022-12-26 19:23:48 +13:00
test feat: Allow actions to return a list of results. (#5) 2022-12-26 19:23:48 +13:00
.drone.yml chore: Update forgejo hostname. 2024-02-05 15:40:53 +13:00
.formatter.exs chore: Run the elixir formatter. 2021-01-27 10:20:18 +13:00
.gitignore Add inch and inch_ci. 2017-04-27 08:04:07 +12:00
.tool-versions chore(deps): update dependency elixir to v1.16.1 2024-01-31 23:18:15 +13:00
CHANGELOG.md chore: Update forgejo hostname. 2024-02-05 15:40:53 +13:00
mix.exs chore: Update forgejo hostname. 2024-02-05 15:40:53 +13:00
mix.lock chore: add mix_audit. 2023-07-25 16:16:03 +12:00
README.md chore: Update forgejo hostname. 2024-02-05 15:40:53 +13:00
renovate.json chore: Configure Renovate (#6) 2023-07-25 18:27:48 +12:00

LexLuthor

Build Status

LexLuthor is a Lexer in Elixir (say that 10 times fast) which uses macros to generate a reusable lexers. Good times.

LexLuthor is a state based lexer, meaning that it keeps a state stack which you can push states on and pop states off the stack, which are used to filter the applicable rules for a given state. For example:

defmodule StringLexer do
  use LexLuthor

  defrule ~r/^'/,              fn(_) -> :STRING end
  defrule ~r/^[^']+/, :STRING, fn(e) -> { :string, e } end
  defrule ~r/^'/,     :STRING, fn(_) -> nil end
end

Rules are defined by a regular expression, an optional state (as an atom) and an action in the form of an anonymous function.

When passed the string 'foo', the lexer starts in the :default state, so it filters for rules in the default state (the first rule, as it doesn't specify a state), then it filters the available rules by the longest matching regular expression. In this case, since we have only one rule (which happens to match) it's automatically the longest match.

Once the longest match is found, then it's action is executed and the return value matched:

  • If the return value is a single atom then that atom is assumed to be a state and push onto the top of the state stack.
  • If the return value is a two element tuple then the first element is expected to be an atom (the token name) and the second element a value for this token.
  • If the return value is nil then the top state is popped off the state stack.

If lexing succeeds then you will receive an :ok tuple with the second value being a list of LexLuthor.Token structs.

If lexing fails then you will receive an :error tuple which a reason and position.

Contributing

  1. Fork it ( https://harton.dev/james/lex_luthor/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request