RegEx - kbin.social

leobm, 13 days ago German

The ?x modifier/flag is nice, never used it before.
Makes it possible to include commentary inside complicated patterns. #perl #regex

original source: https://polar.sh/eval/posts/named-capturing-groups-in-clojure
#clojure

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mjgardner

chrastecky, 20 days ago

Lo and behold, fellow mortals of the programmer variety! I find myself embarking upon a most arduous quest.

One that shall test the mettle of my coding prowess... Yes, thee heard right.

I dare to dance with the elusive and powerful entity known as... the #regex #email #validation!

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

heiglandreas, 19 days ago

@chrastecky All right! 3 step email verification:

No '@', no email address

No MX entry in DNS for the part *after" the '@', no email address

No response to a link in an email to the email address, No email address.

Everything else might look like an email address, but it isn't.... 🤷

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

heiglandreas, 19 days ago

@chrastecky And while

ändi@stella.maris.solutions

looks like a valid email-address (if it doesn't, check your algo) ir is about as valid as "hello world" as an email address.

But that might just be different expectations of what a valid email address is. 🤷

When your customer is happy with above string being considered a "valid email address", then everything is fine...

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lizardbill, 1 month ago

You can't parse [X]HTML with regex. #regex #html https://stackoverflow.com/a/1732454/1288

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

SirTapTap, 1 month ago

@lizardbill

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ CodexArcanum

villetakanen, 1 month ago

#regex #programmer #humour

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rolle

NireBryce, 2 months ago

does there yet exist any application that can take a multi-selection and spit out a #regex that will match that in any file going forward?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ alcinnz

necrosis, 2 months ago German

Liebe Informatik Lehrkräfte, bitte bringt euren Schülerinnen und Schülern #RegEx bei.

Es hilft im Job ungemein. 😅
Ich wünschte ich hätte das schon in der Schule gelernt. 🥹

#FediLZ

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Linkshaender, 2 months ago

@hobbypaedagoge ich grätsche mal rein 😉
Suchen (und Ersetzen) von Text
Parsen von Logfiles
Validieren von Daten/Eingaben (nein, keine Mailadressen!)
Daten bereinigen
Extrahieren von Infos aus Textdateien

Anwendungstipp: awk lernen.

Informatik-Unterricht: Automatentheorie, formale Sprachen, Parsing

Regex sind ein scharfes Schwert, wie dieses sollte der Umgang geübt werden (s XKCD-Cartoon)
@necrosis

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mina, 2 months ago

@Linkshaender

See me whilst presenting common command line tools to Windows users:

@hobbypaedagoge @necrosis

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

alter_unicorn, 2 months ago

DON'T play.
really.

https://www.therobinlord.com/projects/slash-escape

#regex

reply

expand (7)

collapse (7)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Kahte

alter_unicorn, 2 months ago (edited 2 months ago)

Did you?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Flobzh

VoronoV, 2 months ago

@alter_unicorn Je ne sais pas de quoi il s'agit ni ce que ça veut dire mais j'ai répondu OUI pour participer 😂

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stux, 2 months ago

How to #REGEX

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ vascorsd, nrohluap, Wen, grrrr_shark +14 more

Powerfromspace1, 2 months ago

@stux accurate 😉

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Wen, 2 months ago

@stux @nrmacdonald I find it helps with my more interesting emacs commands.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

rdela, 3 months ago
Chat log from this morning’s #StaticChronicles with @zachleat + @mikeneu from @cloudcannon, in which I clumsily praise @paulcuth, @robb, and @bobmonsour among others!
https://gist.github.com/rdela/e8facf1a8a31ea5223c42075cbaa9bb2

Follow on Twitch
https://www.twitch.tv/cloudcannoncms

Subscribe on YouTube
https://www.youtube.com/@cloudcannon

Today's ep. https://youtu.be/Pt5CWtEPmBM

Bonus #RegEx I use to clean up the copy pasted Discord chat…
(?# Space out copy-pasted discord chat)  
(?# find )  
^(.+)\n:\s?  
(?# replace )  
\n$1:\n  
reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

rdela, 2 months ago

#StaticSiteFanClub 2024-03-20 More RSS Reading and https://youtu.be/T3D-nBsc_Swe/T3D-nBsc_Sw](https://youtu.be/T3D-nBsc_Sw)

🔖 aka: #StaticSites with Zach and Mike; #StaticChronicles, #StaticFirst

with
@zachleat +
@mikeneu from
@cloudcannon

Chat log:
https://gist.github.com/rdela/275613be4921dfa81c131d9c6806f899

Follow on Twitch
https://www.twitch.tv/cloudcannoncms

Subscribe on YouTube
https://www.youtube.com/@cloudcannon

Cc @stefan @bobmonsour

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cloudcannon

rdela, 2 months ago

@zachleat @mikeneu @cloudcannon @stefan @bobmonsour speaking of @simevidas and RSS + OPML, have you seen https://github.com/simevidas/web-dev-feeds ??

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ iamdtms, zachleat

maxleibman, 4 months ago

I’ve got an email-parsing project that will require some serious regular expressions.

It’s been a long while since I’ve written any regex. Can anybody recommend any good resources for putting off or avoiding doing it?

#procrastination #RegularExpressions #ReGex

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ maxleibman

kyleejohnson, 3 months ago

@maxleibman @aronow this won’t help you procrastinate, but https://regexr.com is one of my favorite tools once I’m close to the expression I want. It lets you test expressions on text you put in, so you can tweak your expression and see the result changes instantly.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ maxleibman

benzucker, 4 months ago German

Any #regex wizards here?
Is there a way to match multiple linebreaks regardless of the content but only if the number of linebreaks exceeds a value like 5?

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

benzucker, 4 months ago

@barubary
Well there is most likely something nicer than this: \n.+\n.+\n.+\n

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

barubary, 4 months ago

@benzucker n(.*n){3}

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mgorny, 4 months ago Polish

Paczka Pythona #regex (nie mylić z wbudowanym modułem re) zbudowana jest w oparciu o szczegóły implementacji CPythona i nie obsługuje poprawnie #PyPy (i autor zapowiada, że może w końcu zablokować kompilację na PyPy). Jednakże wygląda na to, że wymagająca jej paczka #ReAssert działa bez problemów ze zwyczajnym re.

Dzisiaj #Gentoo przechodzi z łatania w sposób niedoskonały paczki regex, i ignorowania szczególnych przypadków, w których nie zadziała, na rzecz łatania re-assert. Chciałbym wysłać tę trywialną łatkę autorowi, ale — jak już wcześniej narzekałem — dostałem niegdyś bana, autor nie potrafi powiedzieć dlaczego, ale nie przeszkadza mu to uważać bana za sprawiedliwego. Może po prostu proaktywnie banuje devów dystrybucji Linuksa.

https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=8413cf2c2955533fdf212fea3970c99cf193d4a1
https://github.com/mrabarnett/mrab-regex/issues/521
https://github.com/mrabarnett/mrab-regex/issues/404

#Python

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ milosz

snacktraces, 4 months ago

Need a regex for a MAC address?

On a project a while back I needed one and created it. Thought I would share here with everyone.

https://snacktraces.com/blog/regex-for-mac-address.html

#SoftwareDevelopment #regex

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

barubary, 4 months ago

@snacktraces does c+++ not support [[:xdigit:]]?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jpaskaruk, 4 months ago

What else is as incredibly impressive, and at the same time as horrifically ugly, as #RegularExpressions?

By the way, if you need to detect #chords in a text chord chart, here is the #regex you need:

"A-G?(maj|min|m|M|+|-|dim|aug)?[0-9|11|13](sus)?[0-9|11|13](add)?[0-9|11|13]*(/A-G?)?"

edit: any #Musicians out there, can you think of any edge-case chords I should test/adjust to catch? This will be part of a Free chord chart organizer, hit me with your worst.

#Programming

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jpaskaruk, 4 months ago

@barubary

I reached that with some modifications to something I found somewhere, and the impression I've got is that you can use | as a logical OR within a set like that.

I could be wrong there, I'll check up on that. But in the meantime, a real musical chord that fails to properly match against the regex will be more useful to me.

For the moment I'm considering it a solved problem, cause now I need to put on my javascript wetsuit and implement this into a web ui...

https://github.com/dnotes/markdown-it-chords

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

barubary, 4 months ago

@jpaskaruk | only means OR outside of a set like that. Within a [ ] set, every character is already OR'd (e.g. [abc] matches a or b or c, and [a|b] matches a or | or b).

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

youronlyone, 4 months ago

To my fellow #ActuallyAutistics who are also into programming. Can you handle #RegEx / #RegExp?

Up to how much complexity?

When I was younger, it was easy. Today, I have to use a test tool! ^_^;;

#Autistics #Autism #ActuallyAutistic #Autistic #AutismSpectrum #AskingAutistics

@autistics @actuallyautistic

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

nis, 4 months ago

@youronlyone @autistics @actuallyautistic
Anything other than '.' and parantheses, I have to google.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

youronlyone, 4 months ago

@RoundSparrow I understand that, I don't like memorisation because I'm bad with it. I'm more of, the more I use it, the more I'll remember it, not because I memorised it.

Another thing (although off-topic), the castle memory technique, usually attributed to memorisation. But, I don't know, for me, it's a storage technique. If I don't pull out a memory from its storage, I won't even remember it.

@autistics @actuallyautistic

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

danrot, 5 months ago

Generally I like #RegEx, but there are two huge problems for me with it:

1️⃣ I don't need it often enough, making it hard to remember more complex stuff.
2️⃣ As if 1. would not be bad enough, every tool and language uses a different dialect of it 😩

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

danrot, 5 months ago

@dantleech Are you talking about vimgrep? Not even using that, configured to use rg instead. Bit rg does not support lookbehinds, which I had to use today 🙈 At least not unless you set another flag 😕

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

dantleech, 5 months ago (edited 5 months ago)

@danrot no just standard :s/foo/bar/g :)

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stealthmusic, 6 months ago

#IntelliJ always succeeds in surprising me. It has a built-in #RegEx tester that allows testing and changing the expression directly in your code, dealing with all the nasty escaping that is required in Java. It even highlights matching groups. 💚

#swdev #DEVCommunity #software #Java #JavaBubble #AdventOfCode

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kerfuffle

hennell, 6 months ago

This is pretty neat. A better way to #regex in #php? 🤔

https://github.com/gherkins/regexpbuilderphp

reply

expand (13)

collapse (13)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ nf3xn, tcely, ernest, linc +2 more

hennell, 6 months ago

@kboyd @emd

"It's fair to argue that this is one of those places. But it's also fair to argue that this is not one of those places."

I laughed at this, but totally agree with it.
It's very much a 'weigh up the pros and cons' for your use case situation.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kboyd, 6 months ago

@hennell @emd It's a similar scenario to the "Qed" BCMath wrapper library I started writing this week, although as a mere chainable interface (and not a fluent DSL interface) mine is a fair bit less complex.

There are times when it might help, and times when it would not provide a benefit.

https://github.com/beryllium/Qed

No users yet, mind you. And a library with no users may be of little value ... until that first user needs it and reaches for it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jas_hughes, 6 months ago

Day 1 of #AdventOfCode

Like others, found this one much harder than other Day 1 puzzles: especially with that tricky #regex edge case that wasn't in the examples.

I try to stick with base #Rstats in my solutions, and it wasn't so elegant for extracting strings.

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jas_hughes, 6 months ago

Day 2 of #AdventOfCode

I really like when I can re-use the same code for parts 1 and 2, passing different functions as arguments to differentiate the solutions, so I was satisfied with this one.

$Screenshot of R code instr <- gsub(".*: ", "", readLines("02.txt", warn = FALSE)) maxes <- c(red = 12, green = 13, blue = 14) part1 <- function(ns, cols) all(ns <= maxes[cols]) part2 <- function(ns, cols) { max(ns[cols == "green"]) * max(ns[cols == "red"]) * max(ns[cols == "blue"]) } check_game <- function(s, summary_function) { rounds <- strsplit(s, ";")[[1]] pulls <- unlist(lapply(rounds, strsplit, ",")) cols <- gsub(".* ", "", pulls) ns <- as.numeric(gsub(" *(\\d+).*", "\\1", pulls)) summary_function(ns, cols) } message( "Part 1: ", sum(which(vapply(instr, check_game, part1, FUN.VALUE = logical(1)))) ) message( "Part 2: ", sum(vapply(instr, check_game, part2, FUN.VALUE = numeric(1))) )$

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jas_hughes, 6 months ago

Day 9 of #AdventOfCode

Speedy part 1, especially for a recursive approach for me (not something I do often).

But then I spent way too long trying to implement a reverse recursion function before realizing I could just reverse the array (screaming).

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jeff

hamatti, 6 months ago

⭐️⭐️

First day of #AdventOfCode in the bag with two stars!

Today I used #RegEx with my #Python solution:

https://github.com/Hamatti/adventofcode-2023/blob/main/src/day_1.ipynb

#AdventOfCode2023

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

paulox, 6 months ago

@hamatti I've just read your notebook. Great work. I've appreciated your explanation of the solution and your mental process in solving the puzzle. I've to admit that I solve day 1 in a very similar way
https://github.com/pauloxnet/adventofcode/blob/main/aoc2023/day01.py

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

sabret00the, 6 months ago

If I have a string and want to match all characters between the 10th character and the 48th character, what is the proper #regex for that? [A-Z0-9]{10,48} doesn't work 😭

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

sabret00the, 6 months ago

@barubary I was renaming some music files. But they were named as "0X - Artist Name - Album Name - Title.mp3" and the easiest way to rename them in a batch was via Solid Explorer using the REGEX function.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

barubary, 6 months ago

@sabret00the Ah, I see.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 7 months ago

Solved a *problem with #regex with more regex today.

*Not actually a problem with the regex itself, but one of unclear business requirements, but for anyone that said I'd regret the DNS regex I wrote a month later, I ate that soup today and it honestly wasn't bad.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

linux_mclinuxface, 7 months ago

@vwbusguy and how many problems do you have now? Say it with me … that’s right: 2 problems.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 7 months ago

@linux_mclinuxface No, I have \d+? problems.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

themeowcate, 8 months ago French

#Dev #Code

Mon N+1 : "J'aurais besoin de comprendre. Je t'avais transmis ce gros fichier de données toutes bordéliques régurgitées et tu as fourni un CSV tout propre classé et filtré, tu pourrais me passer le script que tu avais utilisé pour faire ça ?"

Moi : "Ah mais j'ai pas de script."

Lui : "Mais comment tu as fait ça ?"

Moi, tout fier : "C'est le pouvoir de la REGEX !"

J'adore les regex. Ça résout tout, les regex ! Tiens, je sais, je vais faire un parser HTML en regex !

#regex

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

adarr_volte, 8 months ago

@themeowcate peut-être, un jour, j'y arriverai, en attendant ils me font péter les plombs...
Chapeau si tu maîtrises.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

themeowcate, 8 months ago

@pasqualeberesti
"Tu es un sorcier, Pasquale"

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 8 months ago

The thing about coding with #regex is that it feels like I'm getting paid to do Sudoku puzzles for a living.

Tip for those who are asked to review code with regex: Rather than focusing on the regex itself, ask to see the automated tests that it is ran against and look for gaps in the tests rather than getting lost in the weeds with scrutinizing the regex itself unless there's an obvious significant performance problem.

reply

expand (33)

collapse (33)

report

activity

copy /kbin url

copy original url

open original url

Loading...

vwbusguy, 8 months ago

@sudoedit This is part of why named groups are useful. You're just chaining starts and ends until you reach the eol or eof.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

barubary, 8 months ago

@vwbusguy My advice is essentially the opposite. Focus on the #regex, at least to get started. Regexes are code. Just like any other programming language, you have to learn the syntax and practice a bit, but the same principles apply as with program code in general.

When reviewing code, start by reading it. If there's something unclear, ask about it. Don't accept a regex consisting of 100 characters in one line without a single space. Compared to most other languages, regex syntax is terse: Few (if any) keywords, lots of symbols. Divide complex regexes into simple parts that are assembled into bigger constructs. You probably wouldn't accept a patch that adds hundreds of lines of unfactored code that has complex logic and nested loops, but no indentation or whitespace and no functions, so why write your regexes this way?

If your language builds regexes from strings, use string concatenation, formatting/indentation, comments, and named variables to make the structure of the pattern clear. If your language has the /x modifier, use it to allow sensible formatting and comments right in the regex (remember to escape with `` or [ ] any spaces that should match literally). If your language supports (?(DEFINE)...) and the (?&foo) syntax for named "regex subroutines", consider using it (but also consider restructuring your code: it might be trying to do too much in a single regex).

Once you understand the structure of the regex and how it is meant to work, it becomes much easier to review the tests: Are there any? Do they cover every input variant, exercising all parts of the regex, both matching and failing? (Failing matches are also relevant for finding performance issues: If a regex finds a match, it usually does so quickly. But a regex with exponential backtracking can take forever to fail because it'll try a huge number of variations before giving up on a string that doesn't match.)

There is an infamous regex for RFC 822 email addresses out there on the internet[1]. It is thousands of characters long and utterly incomprehensible. However, it was not written manually: It is essentially "object code", assembled by commented code using string concatenation from named variables that follow the structure of the BNF grammar in the RFC. Strive for the latter, not the former.

[1] http://www.ex-parrot.com/~pdw/Mail-RFC822-Address.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

villares, 8 months ago

"I hate #regex, but I think this worked fine. I used #regexxer, a helper to find and replace stuff on multiple files, for those [of us] less well versed with the traditional CLI regex workflow."

Any other tips for user friendly find-and-replace tools?

https://github.com/py5coding/py5generator/issues/350#issuecomment-1752025818 #Python #py5

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ py5coding