More

omegalulw · on Oct 15, 2022

> The old app that I wrote would happily take JavaScript or SQL snippets as inputs to any text field and do The Right Thing.

Confused here. Where were input validation checks in your implementation? How did you guard against SQL injection, etc?

Winsaucerer · on Oct 15, 2022

Presumably they just did whatever the standard provided mechanisms for their SQL driver were (such as parameterised queries). User inputs text in a comment box, and you insert it into database using such a mechanism and it's safe.

And if you're using, for example, Go's templating library, then it automatically escapes everything in HTML templates unless you explicitly override this default behaviour.

paulryanrogers · on Oct 15, 2022

Well if it was only 100 lines of plain JS then how would one guard against reflection attacks? I.e. submitting HTML (like script tags) then getting that to render when others view the tainted data.

Winsaucerer · on Oct 15, 2022

Because on this way of building sites, the user submitted data is escaped before it reaches the browser. E.g.: https://go.dev/play/p/MmNSxU5QfAb (hit run to see the output).

The JS wouldn't need to do any escaping, because it's not trusted to handle any unescaped data. It's operating on the already-escaped html template.

skeeter2020 · on Oct 15, 2022

They certainly weren't using Go, or as stated, any framework. Also no mention of any type of web server; not sure what magical code was creating dynamic HTML from the database. Where was the business logic? Stored Procedures? No mention of more dynamic functions... No integrations... Sure sounds like a desktop browser-only app while the majority of the world today wants some mobile functions from almost every system.

There is a lot information, which is understandable but also conveniently supports a very unflattering narrative while simultaneously promoting the OP's awesomeness.

Winsaucerer · on Oct 15, 2022

I think you're reading them far too strictly. I don't think they literally meant they were using nothing beyond JUST the SQL Server and then somehow getting HTML out of that, with 100 lines of JS on top. Unless I misread, I don't see anything that implies they weren't using something like PHP or ASP, for example.

P5fRxh5kUvp2th · on Oct 16, 2022

To me this is the hilarious part.

Q: "how do they use the workarounds needed to secure the more complex approaches?"

A: "those security concerns don't exist in the approach, no workaround needed. That's part of the simplicity".

It just represents a fundamental misunderstanding, but it's not their fault, they've never seen anything else. Like someone using a JWT instead of a session cookie.

eurasiantiger · on Oct 17, 2022

>Like someone using a JWT instead of a session cookie

Those aren’t mutually exclusive, you know.

thow232329 · on Oct 15, 2022

Just put the queries in procedures with parameters. Only store the procedure calls in your backend, disable arbitrary queries completely in your database permissions.

omegalulw · on Oct 14, 2022

Can't have context if the file never loads.

MobiusHorizons · on Oct 14, 2022

Sounds like you need a better editor :)

But seriously, I would expect most editors to handle large files relatively gracefully, even if they aren’t as fast as vim for example. Browser based code viewers on the other hand, I grant you, aren’t generally up to the task.

omegalulw · on Oct 13, 2022

This seems...wrong?

Let's say you can control voltage, cross section area A.

Now, you want a constant power across a load P = V * I.

Your net resistance R is C + D * L / A where C and D are constants and L is your cable length.

I is proportional go V * A / (C * A + E) where E is also a constant.

So your load power is proportional V^2* C * A / (C * A + E) which is proportional to V^2 * A / (A + F) where F is also a constant.

With a large "enough" A, this is effectively V^2. With a small A, this is V^2 * g where g is A / F. So the smaller the area you have the more power you are wasting (roughly equal to V^2 * (1-g) which is heat in the wires).

So the smaller area you have, the less efficient your power delivery. And juicing up your source power is a lot more expensive than juicing up your source voltage.

omegalulw · on Oct 10, 2022

Maybe I'm missing something obvious, but if I insert all 32 bit keys, does that not take 32*2^32 space?

desertrider12 · on Oct 10, 2022

Nope, there will just be 2^8 tables (constant overhead per table) each with 2^24 24-bit keys.

If you start with an array of any N keys, they have 32N bits of information. But once you insert them all into an unordered set like this, you throw away information by losing the order. There are only (2^32 choose N) unique sets of N keys, which is less than N*2^32 for N>1. This checks out in your example - there is only one unique set of all the 32-bit integers, so that should take log2(2^32 choose 2^32) = 0 bits to store sets of this size.

fargle · on Oct 10, 2022

it's not mean for storing all possible 32 bit integers. if you want to do that, it would take zero storage because the answer is just "yes".

what they are doing is using the fact that a hash table can be loaded to 80% or 90% and still work fine. but they can encode part of the data (the first byte) implicitly in the hashed address. it's kind of cheating because the user provides those bits back to the lookup function to get their answer, so they don't need to be stored. and saving one byte out of four means you store 75% which is a bigger gain than the loss due to the 80%-90% load-factor.

say you have a normal hash table with 90% load factor, and you need to store 100,000 32 bit words, it would take 32 * 100,000 / 0.9 = 3555555.

but if you use this (or some related) trick and only save part of the key, you'd have 24 * 100,000 / 0.9 = 2666666. which is even less than 32 * 100,000 if you used direct storage in a simple array.

omegalulw · on Oct 10, 2022

To be fair, I'm not sure if you can demodulate a 100MHz FM signal digitally. DSP at this frequency on an Arduino maybe be infeasible (you need a phase locked loop). I suppose if you used a bandpass + low pass filter you can pull it off but that will be a lot more work still.

omegalulw · on Oct 7, 2022

Wrong way to think about the problem. Lowering bits does not mean lowering model capacity. Its the opposite in fact - it allows you to you to fit more parameters.

pfortuny · on Oct 7, 2022

Well, yes within your constraints. In the end, you are choosing between two aspects. Same as with screens: you could have resolution (1024x768!!) or color (16bits!).

Edit: the term I could not remember is tradeoff.

adgjlsfhk1 · on Oct 7, 2022

kind of. if we get computers that are 1000x faster, it just becomes a tradeoff between higher precision or 1000x more parameters. the reason resolution has stopped being pushed is that our eyes have severe diminishing returns. it's not yet known whether brains do.

omegalulw · on Oct 7, 2022

This is not that relevant for ML. Each gradient pass will re-compute your cost function and the gradients so errors are not likely to accumulate. The main thing is to not make errors big enough that you end up in a completely different part of the parameter space derailing progress which is what the above commenter points out.

thesz · on Oct 7, 2022

It is extremely relevant for ML.

I am familiarizing myself with recurrent neural networks and getting them trained online is a pain - I get NaNs all the time except for very small learning rates that actually prevent my networks to learn anything.

The deeper network is, the more pronounced accumulation of errors in online training is. Add 20-30 fully connected (not highway or residual) layers before softmax and you'll see wonders there, you won't be able to have anything stable.

emn13 · on Oct 7, 2022

This isn't true in general. Very specific ML algorithms that were likely developed with years of blood and sweat and tears may have this kind of resiliency, but I've been in the the numerical weeds enough here that I wouldn't bet on even that without a real expert weighing in on it - and I wonder what the tradeoff is if it's true there. It's very easy to have numerical stability issues absolutely crater ML results; been there, done that.

I have some ~15 year old experience with the math behind some of this, but actually none with day-to-day deep learning applications using any of the now-conventional algorithms, so my perspective here is perhaps not that of the most pragmatic user. The status quo may have improved, at least de facto.

kelseyfrog · on Oct 7, 2022

I'm not really sure there is evidence for that. In fact, depending on your interpretation of why posits[1] work, we may even have empirical evidence that the opposite is true.

1. https://spectrum.ieee.org/floating-point-numbers-posits-proc...

omegalulw · on Oct 7, 2022

You want to ask the author to change the commit you are interested in and then review the diff to that commit.

This is if course mostly valuable if you don't squash commits on merge. Otherwise, the extra rebase work isn't that valuable.

omegalulw · on Oct 7, 2022

That was my question to. 8cc is doing the heavy lifting of compiling C.

omegalulw · on Oct 5, 2022

I wonder how well WPA3 fares here.