Deykun, (edited )
Deykun avatar

There were some comments on Reddit suggesting that cutting the dataset at 15 and removing 40% of words was not the best move. I have locally built a version with the limit set to 30.

For the interested:
https://imgur.com/a/Rz7Cw6x

einfach_orangensaft,

E

HenryWong327,
@HenryWong327@lemmy.ml avatar

E

sebsch,

Kryptographiekurs flashbacks

Halcyon,
@Halcyon@discuss.tchncs.de avatar

Und Glücksrad.

Aiyub,

But why is it not a German keyboard layout?

eldain,

It is, I’m using it: www.neo-layout.org

It is just not able to break the habit of typewriter compatibility.

UpsKaputt,

They were asking why the heatmaps pictures in OP were not superimposed on a German keyboard layout.

They were not asking why no one has made a keyboard layout out of the heatmaps data :)

Just thought I’d clarify because you’re basically having two different conversations at this point

eldain,

Ups, I see my confusion. Thanks.

Aiyub,

That one is fitting the pictures even less

lugal,

German layout is QWERTZ while English is QWERTY and your neo layout is neither

eldain,

I know, but it used the above heatmap to optimize for speed by putting the hottest keys for German on the baseline. It is not the standard keyboard, I learned to use it decades ago when I was a little German high school nerd and had too much time.

Oka,

Seen

d_k_bo, (edited )

Great project! I really like your design language! Though it would be nice if there was a dark mode and if it supported https://developer.mozilla.org/en-US/docs/Web/CSS/@media/prefers-color-scheme.

Deykun,
Deykun avatar

Thanks. It should read prefers-color-scheme. I have dark mode by default, but it's also possible to set dark/light mode too.

d_k_bo,

Oh, somehow I missed the theme setting. I tried both firefox and chromium and got the light theme by default despite having my system/browser settings set to prefer a dark theme.

GravitySpoiled,

A million words doesn’t sound like a lot

Deykun,
Deykun avatar

To clarify, it is not the total number of words but rather the number of unique words considered. Imho a million of unique words is okay. A bigger concern for me would be that words on Wikipedia can be overly specific.

sbv,

That million words sounds like a lot.

GBU_28,

Have you considered a similarity search approach? They would handle your oddly specific synonym issue

Deykun,
Deykun avatar

I only have a prespellechecked list of words from here: http://www.aaabbb.de/WordList/WordList_en.php

GBU_28,

Oh? Name all of them.

KISSmyOS,

A,
A-a,
Aachen,

Localhorst86,
Anekdoteles,

It’s incomplete, as you will only finde 95% of words used on ich_iel.

Deykun,
Deykun avatar

A source: https://deykun.github.io/diffle-lang/de?p=about-language (It has tooltips displaying percentages for other letters)

d_k_bo,

Anzahl der Buchstaben in einem Wort

(Number of letters in a word)

For this metric, Wikipedia might not be a representative dataset. Wikipedia uses many technical terms and composite words which tend to be longer than words that are more common in an everyday dialect.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • dach@feddit.de
  • DreamBathrooms
  • everett
  • InstantRegret
  • magazineikmin
  • thenastyranch
  • rosin
  • GTA5RPClips
  • Durango
  • Youngstown
  • slotface
  • khanakhh
  • kavyap
  • ngwrru68w68
  • tacticalgear
  • JUstTest
  • osvaldo12
  • tester
  • cubers
  • cisconetworking
  • mdbf
  • ethstaker
  • modclub
  • Leos
  • anitta
  • normalnudes
  • megavids
  • provamag3
  • lostlight
  • All magazines