• Blue_Morpho@lemmy.world
    link
    fedilink
    arrow-up
    4
    arrow-down
    5
    ·
    8 months ago

    For instance you can’t write a regex that’ll relibly find the subject, object and verb in any english sentence

    Identifying parts of speech isn’t a requirement of the word parse. That’s the linguistic definition. In computer science identifying tokens is parsing.

    https://en.m.wikipedia.org/wiki/Parsing

    • notabot@lemm.ee
      link
      fedilink
      arrow-up
      7
      ·
      8 months ago

      That’s certainly one level of parsing, and sometimes alk you need, but as the article you posted says, it more usually refers to generating a parse tree. To do that in a natural language isn’t happening with a regex.

      • uranibaba@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        8 months ago

        Thanks for all the explaining. I always wondered why you can’t parse HTML since I first saw the Stack Overflow post, when you can take any HTML code you find and write an expression to work against said set of data.

        I never understood the word parse to mean understanding and building a structure based on any input.