blog.goeswhere.com
Faux' blog: 2011/10/
https://blog.goeswhere.com/2011/10
Diagnosing character encoding issues. Natural language is horrible. Unicode is an attempt to make it fit inside computers. I'm going to make up some terms. A group of related lines representing what English people would call a letter. A group of related lines that might be stand-alone, or might be combined to make a symbol. And use some existing, well defined terms. If you use one of these wrongly, people. A number between 0 and 1.1 million that uniquely identifies a glyph. Nothing to do with. And copy t...
blog.goeswhere.com
Faux' blog: 2012/05/
https://blog.goeswhere.com/2012/05
HTTPS (HTTP over TLS. Is the most accessible form of encryption for end users. It protects against real annoyances. I believe it's probably the most important thing to advocate, even among developers. Will run your search results over TLS. Some websites, like Facebook. Allow you to specify that you always want to use HTTPS. You should. Is a Chrome/Firefox extension that tries to upgrade your connection to HTTPS on any website where it's available. HTTPS (HTTP over TLS) is free. Please mock anyone who does.
blog.goeswhere.com
Faux' blog: 2015/04/
https://blog.goeswhere.com/2015/04
Fat data and Postgres. I spend a reasonable portion of my life dealing with Medium Data: columnar files (typically CSV) you can easily fit on disc, can easily fit in disc cache after compression, and can probably fit in RAM after some pre-processing; files slightly too big to open in modern versions of Excel, and far, far too big for LibreOffice Calc or Apple's Pages. Have a column limit around the 1,000-2,000 column mark. My first attempt was to insert it directly from psycopg2. Create table arrays as s...
blog.goeswhere.com
Faux' blog: 2014/02/
https://blog.goeswhere.com/2014/02
I was at Fosdem 2014. I experienced some things which may count as learning, or just exclamations. I'm not really sure how this kind of thing works. (editor's note: I guess, having written it, that the term is "braindump"). Reproducible Builds for Debian. 65% of the archive can be reproduced from binaries we have available. Is a great source of entropy; timestamps are a great source of source-control noise. Maybe we should make a. For people to use instead of. Is a nice bundle of. Scaling Dovecot: Nice o...
blog.goeswhere.com
Faux' blog: 2011/12/
https://blog.goeswhere.com/2011/12
I'm paranoid, but also poor. I use gitolite. To control access to my git repositories. Because github wanted $200/month. To meet half of my requirements, and wern't interested in negotiating (I tried). Like github, I have two types of git repositories. Public repositories; which show up on gitweb and git-daemon and etc., that everyone can access; and private repositories, which contain my bank details. My conf file consists of:. A set of user groups:. A set of repositories. Both public and private:.
blog.goeswhere.com
Faux' blog: 2014/03/
https://blog.goeswhere.com/2014/03
I've been trying to learn Plover. For a couple of weeks. It's an entertaining experience. Quite a few words still stump me and I end up going to the dictionary. I've built a Plover summary and hints chart. And a new mobile-compatible Plover dictionary look-up. This post was composed on Plover at a total speed of around four words per minute. It's rather hard going so far.
blog.goeswhere.com
Faux' blog: 2012/06/
https://blog.goeswhere.com/2012/06
In light of today's supposed LinkedIn breach, it seems like an appropriate time to finally write up my password policy. Many people have cottoned on to the idea that having the same password on different sites is a bad idea. There's various technical solutions to this, such as generating a site-specific. I, however, believe this scheme to be too inconvenient; they require you to always have access to the site or tool, and don't work well in public places. What we're really trying to do here is:. Come up ...
blog.goeswhere.com
Faux' blog: Fat data and Postgres
https://blog.goeswhere.com/2015/04/fat-postgres
Fat data and Postgres. I spend a reasonable portion of my life dealing with Medium Data: columnar files (typically CSV) you can easily fit on disc, can easily fit in disc cache after compression, and can probably fit in RAM after some pre-processing; files slightly too big to open in modern versions of Excel, and far, far too big for LibreOffice Calc or Apple's Pages. Have a column limit around the 1,000-2,000 column mark. My first attempt was to insert it directly from psycopg2. Create table arrays as s...
blog.goeswhere.com
Faux' blog: 2013/05/
https://blog.goeswhere.com/2013/05
Is a fork of the xqilla. Processor to. make it work better from the command line. XQuery is a language for searching and editing XML; a superset of XPath. For example, let's say you have some XML coming along a shell pipeline:. A b id=5 hello /b b id=7 world /b /a. And you want just the 5th b's text:. Printf ' a b id=5 hello /b b id=7 world /b /a ' xqilla -i /dev/stdin (echo "data(/ b[@id='5'])") hello. Groovy. Much safer than trying to do this kind of thing with regexes or cut or whatever. Even if there...
blog.goeswhere.com
Faux' blog: LXC, dnsmasq and nginx
https://blog.goeswhere.com/2015/04/lxc-dnsmasq-nginx
LXC, dnsmasq and nginx. I have started using lxc. Extensively. It's reasonably easy to set up lxc on modern Ubuntu. I would argue that it's easier than Docker), and, with it's ability to run as a limited user, is much more security friendly. It also gels much better with the way I think about virtualisation and services; I don't want to be locked-in to a vendor specific way of thinking or deploying. All my tools and my muscle memory already understand. It turned out that dnsmasq. Here, we can see. Dig -t...