There are over 600 lines of test code for every line of code in SQLite
Holy mother of god
And they still find new bugs
A limitation of testing is that you can only write tests for cases that you can think of, and cases you can think of ways to write tests for.
It’s still valuable despite this limitation, of course.
That’s not entirely true, e.g. you can do fuzz testing or constrained random testing. Maybe you aren’t including those in “testing”?
I was mostly thinking about hand-written tests and manual test procedures, but yeah, fuzzing can help you catch issues as well and you don’t necessarily consciously know about the test cases you put into the system in that case.
Then again, you have to design the fuzzing input consciously so I guess that’s kind of a “what you can think about”-limitation.
Good point regardless, thanks
You can write shit tests. Finding new bugs doesn’t surprise me. Putting that much effort in does, but 600:1? That’s some serious red flags there. There are only so many variables in a single line of code. How many unhappy paths can there be for a single line?
SQLite is one of the best tested codebases in existence. Having only so many variables per line means nothing
I get it…I’ve never been the maintainer of a codebase that’s deployed on trillions of devices, and backwards compatibility is something to be taken seriously and responsibly when you’re that prolific. I do not begrudge SQLite or any large projects when they make decisions in service to that.
However
It always makes me feel oddly icky when known bugs (particularly of the footgun variety) become the new standard that the project intentionally upholds.
I’m so confused that the same people can say “why does everyone get their undies in a bunch that we happily accept putting arbitrary data in columns regardless of type, that’s good, it’s flexible, but fine, we’ll put in a ‘strict’ keyword if you really want column types to mean something” and also “every other SQL says 1==‘1’ but this is madness, strings aren’t integers, what is everyone else thinking?!”
Strongly typed is orthogonal to strictly typed, so these two properties alone are not contradictory.
However, it is a bit unsettling that, if a column has an INTEGER type affinity, and you try to put a string in it, then the string is implicitly converted to an integer if it represents an integer and just stored silently as-is otherwise.
But they silently converted the string ‘1’ into the number 1. So now in my same code, I want to select back my stringy ‘1’ that I putted in the type affined INTEGER column.
And you are telling me its normal that I don’t get it back ? Or maybe I’m misunderstanding something?
To clarify: I am saying that it is not “normal” that the type you get back out is not only not necessarily the type that you put in, but may be different depending on the value that you put in. Put another way, sqlite is strongly typed unless you mistakenly thought that type affinities by themselves made it be strictly typed, in which case it becomes neither strictly nor strictly typed.
neither strictly nor strictly typed.
I think one of them should be “strongly”, but I understood your point.
Thinking back, I don’t have the doc easily accessible (on phone), but I think the C API state the type you want to read. Like
get_int(smt, VALUE_INDEX, …)
, so at least in the C API, most of this should not be visible. Maybe only theSELECT 1 = '1'
part (or others comparaison fully done in the SQL string)?
It’s not on trillions of devices, just billions. But e.g. a typical android phone has 1000s of sqlite db’s for different purposes.
You’re right, that’s a distinction I failed to make
I had no idea the maintainers of sqlite were religious fanatics.
As long as they don’t go on a holy crusade or forcefully evangelize the entire world by genocide I wouldn’t call them fanatics.
Yeah, the code of ethics thing shocked me as well, but it’s really important to remember that it’s not something they expect users to follow, it’s something the devs are saying they’ll follow. So it’s not too bad in my opinion.
Why do we even need a server? Why can’t I pull this directly off the disk drive? That way if the computer is healthy enough, it can run our application at all, we don’t have dependencies that can fail and cause us to fail, and I looked around and there were no SQL database engines that would do that, and one of the guys I was working with says, “Richard, why don’t you just write one?” “Okay, I’ll give it a try.” I didn’t do that right away, but later on, it was a funding hiatus. This was back in 2000, and if I recall correctly, Newt Gingrich and Bill Clinton were having a fight of some sort, so all government contracts got shut down, so I was out of work for a few months, and I thought, “Well, I’ll just write that database engine now.”
Gee, thanks Newt Gingrich and Bill Clinton?! Government shutdown leads to actual production of value for everyone instead of just making a better military vessel.
another one not mentioned there: sqlite is really tiny: (from https://sqlite.org/faq.html#q18 )
The default configuration of SQLite only supports case-insensitive comparisons of ASCII characters. The reason for this is that doing full Unicode case-insensitive comparisons and case conversions requires tables and logic that would nearly double the size of the SQLite library.
I’m glad to see i’ve been pronouncing it right all these years.
Unfortunately, you’re both wrong 🙂
It’s pronounced “gif”
like the peanut butter?
SKIPPY
like the clear soda?
Yeah I was really surprised by that. Surely “sequelite”, given SQL is commonly pronounced “sequel” (c.f. PRQL).
ass-keh-leet
Their commitment to backwards compatibility, to the point of keeping a known bug that allows primary keys to be null, is both amazing and “wtf”.
You can do backwards compatibility and make breaking changes to fix bugs. All you need is an opt-in “target version”. CMake and Android are good examples of this.
Hmm, well… I have never murdered anyone, not even once! Is that good enough for their Code of Ethics?
Here’s a fun fact not noted in the article: Temporary files in sqlite are named etilqs_something in order to prevent people from contacting the sqlite developers for support when other applications (specifically, McAfee) have decided dump and not prune temp files.
Source: https://github.com/sqlite/sqlite/blob/95f6df5b8d55e67d1e34d2bff217305a2f21b1fb/src/os.h#L57
Here’s a fun fact not noted in the article:
It’s #19 in the article.
Well, I can’t read I guess.
At least I linked to the code, since the article doesn’t seem to do that. The twitter thread it linked to probably does, but I can’t view the replies without logging in.
At least I linked to the code,
I appreciate that. :)
So nerdy, so good
This article is written as though it is targeting FOSS newbie or something – a weird mix of jargon and simple language designed to overawe someone.
Their VCS is at least as interesting as SQLite :)
deleted by creator
#17 mentions strict and links to its documentation.