## Technology

December 21st, 2014

# Guido van Rossum agrees with me

When I wrote How ignorant am I, and how do I formally specify that in my code? I said:

For decades, computer programmers have argued with each other over the issue of strict data-type enforcement, versus dynamic data-type unenforcement. And after 16 years of computer programming, I have come down firmly in the middle: I like optional typing.

Clojure allows me to work dynamically, but to type hint when I want to, so I can get the performance benefits of type ...

December 20th, 2014

Interesting:

I “write” a Call Option when I sell someone the right, but not the obligation, to buy in the future an agreed quantity of something at an price that is fixed now. So, for a payment now, I agree to sell you 10,000 chocolate santas[1] at 56 pence each, at any time up to 10th December. You’re prepared to pay the premium because you want to know that you’ll have santas in your stores at a price you can ...

December 20th, 2014

# Which is better, MongoDB or Postgres?

I am very impressed with the new JSON features in Postgres 9.4. So can Postgres completely replace MongoDB? There are some grouping and aggregate functions that MongoDB has that Postgres does not have. More so, at the Mongo shell I have access to the entire Javascript language, whereas Postgres only gives me SQL plus some cool extensions. I recently found out about “WITH” statements in Postgres. From the point of view of SQL, this is the coolest thing ever. From ...

December 20th, 2014

# An outside perspective on SqlAlchemy

Interesting:

For starters, as you might expect from the difference in Ruby vs. Python cultures, SQLAlchemy is more explicit than ActiveRecord. SQLAlchemy has what is called a “declarative” mode where you define your class and your table all at once (within a class that inherits from SQLAlchemy’s declarative_base class). But you can also explicitly create, edit, and inspect an explicit metadata object which defines the table. Then you can define a Python class for your object – and then use SQLAlchemy’s ...

December 19th, 2014

# The big Python surprise: a culture of composable libraries, like Clojure

It is well known that Clojure has a culture best summarized as “Clojure developers prefer to assemble their own stack from small, composable libraries.”

I have been working with Python lately. I am surprised by that it also has a culture of using small libraries. There have been 2 attempts at monolithic frameworks in Python: Zope and Django (maybe Pylons, though it is done as separate libraries, sort of like Symfony2 in PHP). But for the most part, Python goes for ...

December 19th, 2014

# What happens if you introduce a new pattern in your software but never replace the old?

This is very good:

Each new design and technology choice never completely replaced the one that went before. The application has archaeological layers revealing it’s history and the different technological fashions taken up successively by Laurence, Bruce, Ina and Gordy. If you look along the Version 4 line, you can see that there are four different ways of doing the same thing scattered throughout the code base.

Each successive lead developer acted in good faith. They genuinely wanted to improve the application ...

December 18th, 2014

# How to install Postgres

By coincidence, I installed Postgres 9.3 yesterday and today they released 9.4 so I’m doing it all over again. This was a good setup guide.

Source

December 16th, 2014

# Is PostGreSQL text search good enough?

Some interesting commentary:

As much as I would like it to be ‘good enough’, Postgres’ full text search lacks the feature set to be a reasonable solution to most search use cases. Background: I wrote a product search engine for a company called iTrackr using Postgres full text search, and later wrote the product search for milo.com using Solr. I also designed a simple (but very effective) algorithm to boost search rankings based on historical user click data, by generalizing click data ...

December 15th, 2014

# Logging activity can be expensive

Scaling issues.

Source

December 15th, 2014

# dtrace: error on enabled probe ID 5 (ID 992: io:mach_kernel:buf_strategy:start): illegal operation in action #3 at DIF offset 0

If I run “iotop” I see this error:

I looked into it and found this on StackOverflow:

I’ve looked into it a bit deeper and it does seem that it’s MySQL causing these heavy pulsing writes. When a table has an entry added or updated, does it have to re-write the entire table to disk or could it append/modify existing data? We are using MyISAM tables. We are also using SELECT queries with ORDER BY and TEXT fields, which of course require ...

December 3rd, 2014

# Java gave me a choice that I didn’t need, and I made the wrong choice

Interesting story:

I made a bug once, and I need to tell you about it. So, in 2001, I wrote a reference library for JSON, in Java, and in it, I had this line

private int index

that created a variable called “index” which counted the number of characters in the JSON text that we were parsing, and it was used to produce an ...

November 30th, 2014

# Why use Java?

This article is called “Java For Everything” and it makes a good case for re-using the same language over and over again, but it does not make any case for Java.

This makes clear of using the language that you know best:

And finally, I went to write a simple program that put up a web interface. I considered using Python, but this would have required me to figure out how to serve pages from its library. I had already ...

November 30th, 2014

# Letting others write your Emacs config

This is an interesting attitude:

After a long time of maintaining my own configuration I took the decision a few years ago to outsource it to people who really care about this stuff. I started with the Emacs Prelude (http://batsov.com/prelude/) which introduced me to some great tools. I recently switched to Vim bindings (evil-mode) and a week ago switched to Spacemacs (https://github.com/syl20bnr/spacemacs) which integrates better with evil-mode . So far I’m really liking the Spacemacs approach. tl;dr: Just use one of the ...

November 29th, 2014

# peerinvalid The package mocha does not satisfy its siblings’ peerDependencies requirements!

Setting up a new complicated system. I got to the point where “make build” completed. But I run into trouble with “make run”.

I don’t know why this error exists:

npm ERR! peerinvalid The package mocha does not satisfy its siblings’ peerDependencies requirements! npm ERR! peerinvalid Peer karma-mocha@0.1.9 wants mocha@* npm ERR! peerinvalid Peer mocha-as-promised@2.0.0 wants mocha@>= 1.8.0 Source

November 29th, 2014

# nixd is an automation framework to build your application environment

I am not clear why this is better than running a VM but this is an interesting attempt to fix the problem of setting up complex systems on machine:

nixd is an automation framework to build your application environment for development and production. It sits between build tools and configuration management engines, and lets you write simple shell scripts with hooks that will reliably get you from pristine source to running code in one command, and make every attempt to ...

November 29th, 2014

# We need a unified abstraction for apps to discover each others formats

Myself and a friend exchanged some email regarding the article “The Future of Asynchronous IO in Python”. My friend wrote:

Seems like he wants zmq/nanomsg embedded inside Python. I am not really sure why.

I responded:

I think what he is suggesting is more interesting: a single, unified way of speaking to all protocols (MySql, MongoDB, Redis, Kafka, etc). When he says “protocol” I think he means “byte format”. He points out that each app generally has its own method of ...

November 27th, 2014

# Slingshot cleans up its “catch any exception or object” syntax

Personally, I am pleased to see Slingshot improve this, as it is has long been a source of confusion. Paul Stadig writes a long comment that is critical of this change, but I disagree with his assumptions, especially here:

“It is confusing to have a class selector that selects against the wrapped object instead of the wrapper.” In other words (catch RuntimeException e …) selects against the wrapper object that is thrown whereas (catch Object o …) selects against ...

November 27th, 2014

# Why use Github and also Google Groups for an open source project?

This seems to be a universal pattern, where an open source project has both a Github account and a Google Groups mailist, but is the mailist really needed? Why not just use the tools offered by Github (issues, the wiki, etc). I like that Monger introduced a Feature Request label on Github:

Source

November 27th, 2014

Assertions don’t seem to be used very often in Python, but they are used in Flask:

eugene-eeo started the conversation:

eugene-eeo commented 10 hours ago

I was browsing through the source and found it weird that asserts were used instead of normal exceptions.

untitaker replied:

Alright. Because Flask is using assert-statements exactly as usually recommended. The Python wiki explains the meaning of assertion errors:

Assertions are not a substitute for unit tests or system tests, but rather a complement. Because assertions are a ...

November 27th, 2014

# Can Gunicorn die if bash quits before forking is done?

A very intersting bug:

Sly010 does some epic digging to find out why Gunicorn sometimes dies on startup:

Did a bit more digging:

# relevant section from daemonize() in util.py

if os.fork(): os._exit(0) os.setsid() ...

# (added 3 prints) # 3 out of 5 times this prints “1″ then silence. No gunicorn. # There is no error message and nothing in the logs.

if os.fork(): print "1" os._exit(0) print "2" os.setsid() print "3" ... # (added an extra sleep) # This prints 1,2,3 (but I would ...

November 27th, 2014

# The effects of signal() in a multithreaded process are unspecified

I did not know this, and I find this very surprising:

Continuing reading the manual page, the very first note under the NOTES section is this:

The effects of signal() in a multithreaded process are unspecified.

This is called Unspecified Behavior. What it means is that standard does not say anything as for how the function should behave in a multi-threaded environment. Therefore, it may exhibit a different behavior on different systems including different versions of the same system, at discretion of ...

November 27th, 2014

# Why I don’t use classes or protocols in Clojure

This post on Github sums it up very well. For almost every situation where, in an object oriented language, I would reach for interfaces and classes, in Clojure I tend to just use multi-methods.

The key issue is polymorphism. All of us want to work in a language that gives us powerful forms of polymorphism. All of the major languages offer us some types of polymorphism, but let us ask, which gives us the most powerful and flexible forms? ...

November 27th, 2014

# Returning an object from eval() in Javascript

I did not know this:

eval parses its argument in statement context. You have to put parentheses around an object literal if you want eval to return an object:

> eval(‘{ foo: 123 }’) 123

> eval(‘({ foo: 123 })’) { foo: 123 }

Source

November 27th, 2014

# A basic skeleton for python decorators

This is a nice resource that I will almost certainly come back to:

### Example usage ### class PrintDebugDecorator(Decorator): def _call(self, *args, **kwargs): print 'calling', self._f, 'with', args, 'and', kwargs return self._f(*args, **kwargs) class ModuloDecorator(Decorator): def _init(self, n=None): self.__n = n if n else 42 def _call(self, *args, **kwargs): ...

November 27th, 2014

# Protecting against JSON attacks in Flask

Interesting debate. So where should Flask enforce safety for the handling of JSON?

dghubble commented 18 hours ago

My understanding is that the previous behavior (always converting the args to a dict) was sufficient to always prevent a top level json list, but blocked the legitimate use case of jsonify’ing an object for which one has written a custom JSONEncoder. This PR seems to enable that ability, while still protecting against top level json lists.

Whether json lists are considered a vulnerability Flask ...

November 27th, 2014

# Top level arrays in Javascript are dangerous: Register callback on Javascript Array setters

Interesting attack:

So now what happens if a clever hacker is embedding this to his website and social engineers a victim to visiting his site:

<script type=text/javascript> var captured = []; var oldArray = Array; function Array() { var obj = this, id = 0, capture = function(value) { obj.__defineSetter__(id++, capture); if (value) captured.push(value); }; capture(); } </script> <script type=text/javascript src=http://example.com/api/get_friends.json></script> <script type=text/javascript> Array = oldArray; // now we have all the data in the ...

November 27th, 2014

# Weird that under Python 3.4 runtime you can import from 2.7

Interesting:

Weird that under Python 3.4 runtime you can import from 2.7. That should not be possible for all those sanity reasons.

Because allowing that means:

Thanks. I might miss something obvious but when using gunicorn MYPROJECT.wsgi:application i get:

File “/virtualenv/lib/python3.4/site-packages/django/utils/timezone.py”, line 13, in import pytz File “/virtualenv/lib/python3.4/site-packages/pytz/__init__.py”, line 37, in from pytz.lazy import LazyDict, LazyList, LazySet File “/virtualenv/lib/python3.4/site-packages/pytz/lazy.py”, line 3, in from UserDict import DictMixin File “/usr/lib/python2.7/UserDict.py”, line ...

November 26th, 2014

# Prepare for broken builds

Funny:

Source

November 26th, 2014

# Every programming language has its own culture

It is interesting to note that this article about decorators in Python does not use the “word” closure, even though that would be the obvious way of talking about these, were we discussing a language like Javascript, where discussion of closures has been central to the culture since Doug Crockford first discussed them back in 2001.

So, decorators are an application of closures, i.e. closures applied to functions? Could you explain the ...

November 26th, 2014

# Ugly mutable parts of Python

I like Python more than PHP, but the mutable parts of Python are as ugly as PHP. This function is an exampe:

def transform_words(content, targets, transform): """Return a string based on *content* but with each occurrence of words in *targets* replaced with the result of applying *transform* to it.""" result = '' for word in content.split(): if word ...

November 24th, 2014

# Python and its discontents

Python started off as a clean procedural language, with some immutable objects that would have allowed it to evolve into a beautiful functional language, but instead, with version 3, it veered in the direction of classic object oriented programming. I disappointed by this and so I have some interest in the various disappointments expressed by others:

By far my biggest problem with the language is the stupid slot system. I do not mean the __slots__ but the internal type slots for ...

November 24th, 2014

# An abundance of file watchers in Clojure

This is a good overview of the file watchers in Clojure. I had no idea there were so many:

Typically file watchers are implemented using either one of two patterns:

verb based – (add-watch directory callback options)

noun based – (start (watcher callback options))

The difference is very subtle and really centers around the verb start. If the verb start does not exist, we can treat the file-watch as an action on the file object. However if it does exist, we can treat ...

November 24th, 2014

# Instant dashboard with the ELK stack?

I need to build some dashboards for a business. I am impressed with Riak but I would have to find some way to build the dashboard. In theory, the ELK stack gives an almost instant dashboard. I am not at all interested in Logstash, but the rest of this sounds interesting:

We’ve built a data analysis and dashboarding infrastructure for one of our clients over the past few weeks. They collect about 10 million data points a day. Yes, that’s ...

November 24th, 2014

# How to write errors to the terminal while booting a daemon

A very clever trick from Unicorn, which can probably be used everywhere:

And then there’s the ready_pipe Unicorn uses, which is actually quite an amazing trick. See, if you want to daemonize a process under Unix, you need to call fork(2) two times (and do some other things) so the process is completely detached from the controlling terminal and the shell thinks is the process is done and gives you a new prompt.

What Unicorn does when you tell it to run ...

November 24th, 2014

# Using Emacs Eshell as the ultimate shell

Very interesting idea, I am tempted to try it:

File Selection

If all you were doing was renaming a single file, or changing access permissions on all files in a directory, you’d hardly need a flexible shell, as dired or even Finder is sufficient for those tasks. A shell comes in handy when selecting a subset of files based on a pattern, and EShell really shines here, because of its filters.

$ls -al *.mp3(^U) # Show songs I don’t own If ... Read More Source November 23rd, 2014 No Comments # The Tracy-Widom distribution Interesting: The Tracy-Widom distribution is an asymmetrical statistical bump, steeper on the left side than the right. Suitably scaled, its summit sits at a telltale value: √2N, the square root of twice the number of variables in the systems that give rise to it and the exact transition point between stability and instability that May calculated for his model ecosystem. The transition point corresponded to a property of his matrix model called the “largest eigenvalue”: the greatest in a series of ... Read More Source November 23rd, 2014 No Comments # Setting up a daemon to run automatically at startup If I sudo to root and run: ls -alh /etc/rc2.d/ I see something like this: Apparently, on Ubuntu, the creation of these links is managed by update-rc.d DESCRIPTION update-rc.d updates the System V style init script links /etc/rcrunlevel.d/NNname whose target is the script /etc/init.d/name. These links are run ... Read More Source November 23rd, 2014 No Comments # Controlling the aggressiveness of swap Interesting: sysctl is similar ulimit: It allows to configure kernel parameters at runtime. If you wish to keep settings persistent across reboots you should edit /etc/sysctl.conf – be aware that wrong settings may break things in unforeseen ways. Code Listing 4: Exploring sysctl variables # sysctl -a … vm.swappiness = 60 … The list of variables is quite long (367 lines on my system), but I picked out vm.swappiness here. It controls how aggressive swapping will be, the higher it is (with a maximum of ... Read More Source November 23rd, 2014 No Comments # Escort for command line utilities in Ruby This looks awesome: A lot of the existing CLI making libraries delegate to OptionParser for actually parsing the option string, while OptionParser is nice it doesn’t allow things like specifying the same option multiple times (e.g. like CURL -H parameter) which I like and use quite often. Trollop handles this case nicely, so a Trollop-based CLI tool is superior. Also a lot of the other CLI libraries in an attempt to be extra terse and DRY make their syntax a little ... Read More Source November 23rd, 2014 No Comments # Login into remote server with one word alias I need to remind myself of this every time: The SSH config file is a set of key-value pairs. One type of setting is a Host keyword with other attributes. Thus, an appropriate Host entry might be:$ cat .ssh/config Host annoying HostName annoying.hostname.example.org User admin Port 22022 Now the previous examples become:

$ssh annoying …$ scp some-file annoying: … Finally, the last step is to take advantage of alias in your shell. In your .profile (or .bash_profile, etc):

June 24th, 2014

# Refactoring fat models

I had a job interview at Bookspan.com.

At first I talked to a guy named Tom. I assumed he was the leader of the tech team, so I talked about my wide experience. He seemed confused by my recent experience with Ruby. He asked if I was a serious PHP programmer? I said a few negative things about PHP, since most corporations are now pulling away from it. PHP is going out of fashion (as you can see in this ...

June 24th, 2014

# Eventual consistency is probably the only consistency that one can hope for using Javascript and WebSockets

On the one hand, I am very impressed with this article: Eventual Consistency in Real-time Web Apps. On the other hand, how can anyone keep up with what is best practice in the land of Javascript, when every week seems to bring a new framework or methodology?

Having said that, I’ll point out that there is no way to ensure a 1-to-1 match between one’s backend model and one’s front-end model, so all one can do is pick one to ...

June 24th, 2014

# PHP-FPM with Nginx

I already linked to this, but I will link again because, seriously, this is one of the best setup tutorials I’ve seen:

One of the greatest strengths of PHP-FPM is its ability to scale its worker processes up and down as load on the server increases. PHP-FPM can have several “pools” of PHP handlers: one for each different Web application, with different numbers of worker processes and different rules about when to add more processes or kill idle processes.

Our needs ...

June 24th, 2014

# Docker is the future

Something like Docker is probably the future. Clearly, virtual machines is becoming a popular way to manage dependencies. However, Docker still has problems:

Misconception: If I use Docker then I don’t need a configuration management (CM) tool!

This is partially true. You may not need the configuration management as much for your servers with Docker, but you absolutely need an orchestration tool in order to provision, deploy, and manage your servers with Docker running on them.

This is where a tool like ...

June 17th, 2014

# Emacs struggles to get a good package manager

Kind of pathetic that the world’s best text editor only got package-support in 2012:

GNU Emacs 24 (released in June 2012) introduced official support for packages, that is, a way of installing extensions from a remote repository. This was a huge step forward for Emacs, as it not only allowed users to easily find and install extensions, but it also made it possible for extensions to build upon other extensions without having to tell the user “great you want to install ...

June 3rd, 2014

# The strange way my memory works

I was at a job interview today and I said “Do you remember that essay that James Garret wrote in 2004, in which he coined the acronym ‘AJAX’ “?

Feeling uncertain about what I said, when I got home I decided to check my facts, and found that his name is actually “Jesse James Garrett“.

I find it strange that my memory would hang onto his middle name rather than his first name.

I am also surprised that the term “AJAX” ...

June 1st, 2014

# Avoid overcrowding your web server with too many unneeded processes

This is some very good advice (I’ve been thinking of using Arch Linux for future projects, as I understand it is an extremely minimalist Linux):

Avoid overcrowding your web server with too many unneeded processes. For example, if your server is purely for web serving, avoid running (or even installing) X-Windows on the machine. On Windows, avoid running Microsoft Find Fast (part of Office) and 3-dimensional screen savers that result in 100% CPU utilization.

Some of the programs that you can consider ...

June 1st, 2014

# Using Apache Flood to test websites

I only just learned about Flood. In the past I used ab, which is a severely limited tool, in that it only sends HEAD requests. What I have often wanted is a tool that was as simple as ab, but which could send parameters, make a GET request, and give me more feedback than CURL. The XML config for flood suggests it is nowhere as easy to use as ab, but it still looks fairly simple and it looks like ...

June 1st, 2014

# How does PHP work

I like this graph about PHP execution. The white boxes show how things worked as long ago as PHP4 and the gray boxes show all the new and optional stuff:

Source

June 1st, 2014

# PHP and Squid

This is interesting:

Perhaps the most significant change to PHP performance I have experienced since I first wrote this article is my use of Squid, a web accelerator that is able to take over the management of all static http files from Apache. You may be surprised to find that the overhead of using Apache to serve both dynamic PHP and static images, javascript, css, html is extremely high. From my experience, 40-50% of our Apache CPU utilisation is in ...

June 1st, 2014

# Threads in a Unix process

While doing research for the post I just wrote about Joe Watkins and threads in PHP, I came across this graphic, which I thought did a nice job of showing threads in a Unix process:

Source

June 1st, 2014

# Setup is hard, and it wastes an extraordinary amount of developer time

I was just looking at this article, PHP with PHP-FPM, and it brought back memories of Timeout.com. All of the developers who worked at Timeout eventually had to set up the company CMS, and this was something of a hazing process, in that every developer later remembered the experience with dread. The CMS was a massive Symfony project — without question, the largest PHP code base I have encountered in 14 years of working with PHP. I’ll give you a ...

May 31st, 2014

# Using Xdebug to profile PHP

Interesting overview of using Xdebug:

I’ve sorted the execution time of each call in order to determine which calls are the most expensive. The call to the Default_Model_Platform model’s hot() method ranks up towards the top, and because I know this data changes only every few hours, now I can safely cache it and thereby eliminate this expensive database query (which is indeed a fairly large JOIN operation). After implementing caching I again profile the page and indeed have eliminated that ...

May 31st, 2014

# Do not ever use MemCache

#2. Use a Profiler

#3. Tone Down Error Reporting

#4. Take Advantage of PHP’s Native Extensions

#5. Use a PHP Accelerator

#6. Avoid Expensive Operations Through Memory Caching

Missing from the list is “Ask yourself if PHP is the correct language for what you are trying to do.” I think of this especially in regard to the 6th tip, about memory caching. If you start using something like MemCache, then you ...

May 31st, 2014

Is there a facility to use thread-local storage?

and Joe Watkins replies:

The static scope of a class entry can be considered thread local, in a way. Complex members (objects and resources) are nullified when creating new threads, but simple members (arrays/strings/numbers/mixture of any of the above) are copied, so in the static scope can be class::$config which contains connection info to whatever and class::$connection can be the connection itself, ...

May 31st, 2014

# In PHP, the foreach() loop is a performance murderer

I didn’t know this:

Source

May 31st, 2014

# Building a recommendation engine for one’s customers

This is a great overview of building a prediction engine. Some of the math is over my head. Of these approaches, the only one I have the slightest familiarity with is cosine similarity.

Source

May 4th, 2014

# The further rise of polyglot programming

Interesting:

Almost every language shows a long-term downhill trend. With the exception of Java and (recently) CSS, all of these languages have been decreasing. This was a bit of a puzzler and made me wonder more about the fragmentation of languages over time, which I’ll explore later in this post as well as future posts. My initial guess is that users of languages below the top 12 are growing in share to counterbalance the decreases here. It’s also possible that ...

May 2nd, 2014

# Design by contract in Clojure

I posted a comment here:

I have gotten in the habit of doing 2 things: 1.) I used :pre and :post conditions as you are doing here 2.) I also use dire so when the :pre or :post conditions fail and an Assert exception is thrown, I can capture the arguments and the return value and write a meaningful error message:

https://github.com/MichaelDrogalis/dire

I do a lot of this: :post [(:discount %)] I also test for value ranges: :post [ (> (:totals %) 100) (< (:totals %) 1000) ] I am thinking I might ...

April 21st, 2014

# What is the correct way to handle errors when making HTTP calls?

I like this:

I wanted a simpler solution that:

treated exceptions as exceptions was general enough to leverage clj-http exception gave informative error messages in the right place A quick side note on exceptions vs happy paths everywhere. Some will say a bad response isn’t an exception, but is something to be expected. I agree. This is something that should be handled at the app level though, and not the library level – if I get a 401 as an end user I expect to ...

April 21st, 2014

# The problem with frameworks

This is funny:

Let’s pretend I’ve decided to build a spice rack.

I’ve done small woodworking projects before, and I think I have a pretty good idea of what I need: some wood and a few basic tools: a tape measure, a saw, a level, and a hammer.

If I were going to build a whole house, rather than just a spice rack, I’d still need a tape measure, a saw, a level, and a hammer (among other things).

So I go to ...

April 21st, 2014

Late 2012 architecture Many open source components Memcache, redis, MySQL, etc. Necessarily heterogeneous Organized around services Distinct responsibilities Isolated from each other Distributed computation and data RPC between systems Multiplexing HTTP frontend Crucial for modularity, load balancing

Programming the datacenter Concerns include Partial failures Deep memory hierarchies Split heaps Dynamic topologies Changes in variance, latency tails Heterogeneous components Operator errors Taming the resulting complexity is the central theme of our work.

Source

April 21st, 2014

WhatsApp was recently bought by Facebook, for $19 billion. This is the first time a startup had a big success, using Erlang as the basis for all of its technology. The most stable commercial computing device in history is a telephone switch developed at Ericson and built with Erlang. It can handle millions of simultaneous connections, it has 1.7 million lines of code, and it averages 1 hour of downtime every 20 years. The one language I would like to learn, other ... Read More Source April 21st, 2014 No Comments # Fast setup with Vagrant and Ruby 2.0 This is a good overview: How to do it: download and install Vagrant – I use version 1.2.7 You can check that with: vagrant -v You should see: Vagrant version 1.2.7 create a folder for your Rails application and go to it mkdir rorapp cd rorapp initialize your Vagrant machine vagrant init precise32 http://files.vagrantup.com/precise32.box This is your virtual machine (server) that will hold and run your Rails application. If you check now, Vagrant created a config file in your rorapp folder, called Vagrantfile. You’ll change that a little on next step. make ... Read More Source April 21st, 2014 No Comments # Formal proofs for software: prove small theorems, not grand ones Interesting: For “you can’t prove anything big enough to be useful!”, consider the Quark project: http://goto.ucsd.edu/quark/ showed you don’t need to prove a program of interesting size. You can defend millions of lines of buggy code with a “software firewall” made of formally verified code. Verify the right thousand lines of code that the rest needs to use to talk to anything else, and you have very strong security properties for the rest of the code. seL4 and CompCert are clearly also quite useful programs. … I don’t think the technology ... Read More Source April 18th, 2014 No Comments # Kill your process and restart it is still a popular hack in Ruby land I find it surprising that this is still accepted as best practice in the Ruby community: But it’s not all bad news. The Ruby core developers are aware of the problem, and there are some changes (3GenGC, oldgen space estimation) being tested which may bring relief. But right now, for users of Ruby 2.1, this is a very real problem that could easily affect you in production. What did we do? We used a combination of Unicorn::OobGC and unicorn-worker-killer to help tame ... Read More Source April 18th, 2014 No Comments # Vagrant is easy This page offers a simple explanation of how to start using Vagrant: Getting Vagrant started Before you can run vagrant, you’ll need to download and install a few things: The Vagrant tool – This is the actual vagrant tool itself. It manages virtual machines Virtualbox – Virtualbox is the virtual machine where your code will run If you have an app that already has a Vagrantfile in it, it’s very easy to get started. At the command line change into the folder with the ... Read More Source April 18th, 2014 No Comments # What is devops? A devop is like a sysadmin, so why do we need a new word? Because of the new emphasis on automation: A DevOps person isn’t someone who develops, and who does Ops. It’s someone who does only Ops, but through Development. The last time I looked for a senior sysadmin — less than a year ago — I didn’t get anyone who was comfortable programming in Perl/Python/Ruby until I started using the term DevOps. There are companies where the developers are ... Read More Source April 8th, 2014 No Comments # Forked processes, concurrency, and memory problems Last week I expressed my doubts about Unicorn (and the idea that it uses processes, therefore it uses Unix, therefore it must be good). Here is another article that looks at Unicorn, and in particular the memory consumption that goes along with forked processes: Unicorn uses forked processes to achieve concurrency. Since forked processes are essentially copies of each other, this means that the Rails application need not be thread safe. This is great because it is difficult to ensure that ... Read More Source April 7th, 2014 No Comments # Yahoo has some very stupid programmers Good lord, why is this developer at Yahoo so slow on the uptake? Thank you for your submission to Yahoo! Unfortunately we are unable to reproduce the bug due to insufficient information. Please provide us with a proof of concept or any other additional evidence required to reproduce the issue. ** The attacker would have to know the invitation id correct? One has the sense that the person reporting the bug is shocked by the lack of concern shown by Yahoo: d4d1a179c0f3 changed ... Read More Source April 7th, 2014 No Comments # The Clojure workflow still suffers and the REPL is not a cure all Stuff like this happens to me: Here is a scenario that you might recognize. You’ve done a pretty substantial refactor, including new dependencies in project.clj. You need to bounce the REPL. Knowing that this will take forever you immediately switch to Prismatic. 15 minutes later you look at your Emacs again where you notice that there is a syntax error so the REPL didn’t launch. You parse the impossibly long stack trace and fix the bug. cider-jack-in again and switch back ... Read More Source April 6th, 2014 No Comments # Behavior driven development is broken This is very good: If it takes you ten lines to communicate the idea of adding subpages, then you’ve wasted my time. I’m not alone in thinking this. BDD expert Elizabeth Keogh tells us: “If your scenario starts with ‘When the user enters ‘Smurf’ into ‘Search’ text box…’ then that’s far too low-level. However, even “When the user adds ‘Smurf’ to his basket, then goes to the checkout, then pays for the goods” is also too low-level. You’re looking for something ... Read More Source April 5th, 2014 No Comments # Why has Linux not seen more forks? Strong language, and strong opinions, as always, from Linus Torvalds. Now that I think about it, isn’t it amazing that Linux remains stable, even after all these years. I remember someone predicting, years ago, that Linux would split apart into a million useless forks, just like Unix did a long time ago. But that never happened. There are a lot of distros, but the kernel remains 100% under the control of Torvalds. That must mean people trust him. And ... Read More Source April 4th, 2014 No Comments # Using Gloss to change bytes into Clojure data structures Interesting: I started creating a very simple protocol to allow clients to connect via telnet. So it is: PUT LSA |* We have two main commands, PUT and LSA. For PUT, author is the guy speaking, via is who noted it, and the fact is the statement itself. And for LSA command, you can pass the author’s name and the system will return all the facts spoken by the author. * means you want to read all the facts. Any other command ... Read More Source April 4th, 2014 No Comments # Hashmaps versus btrees Interesting: Unsuprisingly, a hash map performs far and above the rest. This is to be expected, mapping is exactly what hash maps are for and, in most situations, they should perform insertions and lookups with amortized O(1) time complexity. However, for situations where you made wish to preserve order, a tree may be a better choice. For that, you can see that a well-tuned btree was outperforming a red/black tree by more than 2 times. As memory architectures begin to behave more ... Read More Source April 4th, 2014 No Comments # Just Libraries – the composition of small apps Clojure favors the composition of small apps. The Clojure community has shown a resistance to monolithic frameworks like Rails. Now Immutant is moving further down the small app road. For its second major release, Immutant will simply be a collection of libraries, one for each of the commodity services currently available to applications using an Immutant 1.x container: web, scheduling, messaging, caching, and transactions. These services will be provided by the following underlying components: Undertow, Quartz, HornetQ, Infinispan, and Narayana, ... Read More Source March 31st, 2014 No Comments # The worst web site ever: healthcare.gov What an incredible disaster. I say this as a professional who develops websites. Several states, such as Kentucky, built their own web sites, which have worked great. But the Federal site, even 6 months after launch, remains a disaster. This is the error message I got when I just now tried to sign up: Today’s the last day to sign up for Obamacare if you’re planning on using the healthcare.gov website. Unfortunately for people who tried to log onto the ... Read More Source March 31st, 2014 No Comments # Java 8 has an Optional to deal with NullPointerException I don’t think I am impressed with this. The idea is borrowed from Scala. I have no love for Java or Scala, and I only follow Java because it impacts Clojure. If this enables Clojure to do something clever with NullPointerException, then maybe I will reevaluate this. Source March 31st, 2014 No Comments # Content Security Policy and Ruby and Clojure Although I love Clojure, I must admit that Ruby and Rails have an impressive depth of gems to help with every aspect of web development, including security. John P Hackworth recently wrote of the weakness of the Clojure eco-system, although his criticism is also an attack on the whole of idea of “small libraries that compose well” which amounts to an attack on the idea of “small pieces, loosely joined”. Clearly, good security can be achieved with small libraries that ... Read More Source March 31st, 2014 No Comments # Using Clojure to build a microservices CMS Many of us become cynical about technologies that promise big breakthroughs in productivity, so we become overly careful in our choices, but this is a good question for managers to always be asking: “Why would a large organisation with a mix of technologies and legacy systems want to muddy the waters with a completely new language?” If you want to make the conservative choice, and stick with what you already have, you should be able to articulate the reasons as clearly as ... Read More Source March 30th, 2014 No Comments # eat food for food in foods when food isnt ‘chocolate’ Of the many attempts to re-invent Javascript, the mostly puzzling to me are those that do not fix any problems, and then invents some more. I realize there is a strong desire to borrow ideas from Ruby and bring them to Javascript, but where one can’t do that cleanly, one shouldn’t do it at all. It’s a tool, that is all. Ambiguous code is a poorly thought out contrived example with a simple solution. To me, this: eat food for ... Read More Source March 30th, 2014 No Comments # Chris Granger: more problems with object oriented programming At this point the evidence against object oriented programming seems overwhelming. I’ve linked to many articles here on this blog. Chris Granger offers another take on this issue: Programming is unobservable We can’t see how our programs execute. We can’t see how our changes affect our programs. And we can’t see how our programs are connected together. That basically means we can’t observe anything. The state of the art in observability is a stepwise debugger, which forces us to stop the world ... Read More Source March 30th, 2014 No Comments # NoSQL is a new of doing things, not a drop-in replacement for SQL I like this: Both NoSQL and Erlang had a burst of use and interest but because they were seen as silver bullets. Soon people realized you couldn’t simply translate your imperative code to Erlang and see improvements but instead regressions. Additionally, throwing your relational data at a NoSQL databases caused the same. I feel the NoSQL culture and programmers haven’t retracted to the core yet as much as Erlang. Though Erlang may see another surge of misuse and misinterpretation now with the ... Read More Source March 27th, 2014 No Comments # The advantages of Ruby on Unicorn This is an interesting way to look at things. Since so much of Ruby code is not thread safe, the fact that Unicorn spins up processes that don’t talk to each other is the most safe way to get concurrency in Ruby. That is a good point, though it is equivalent to saying “Since the code is broken, the the application server to do something weird to compensate for the brokenness.” Clearly, some people have good results with this, though ... Read More Source March 27th, 2014 No Comments # More negative views about Rails Rails lacks a story for concurrency. This is written by a Go programmer. Their criticisms are similar to mine, though for me the answer is “use Clojure” and so I end up doing JVM tuning, which is brought up as something scary to keep people away from jRuby. My impression is that the case against jRuby is weaker than the case against MRI Ruby (the C version). Rails is fundamentally – and catastrophically – slow. This well-known set of webapp ... Read More Source March 25th, 2014 No Comments # If Unix is good for Unicorn, why can’t Unicorn handle slow connections? I wrote about this recently, but I want to add to what I said. In what I now think of as a famous essay, Ryan Tomayko said “I like Unicorn because it’s Unix“. There must be something to this because the essay has been widely quoted, and I remember it, and I have re-read 3 times in the last 4 years. It had an impact. And yet, nothing in it convinced me to adopt that model. I rejected it and went ... Read More Source March 24th, 2014 No Comments # Photon could save PHP I have been extremely critical of PHP for the last 2 years. See “Why PHP is obsolete“. However, I just stumbled across Photon, which seems to address some of the core problems I see with PHP (especially the lack of any tools for dealing with concurrency): Why targeting Mongrel2? Mongrel2 is a very well designed, high performance server developed by pragmatic users who do not like bloated software. The use of ZeroMQ as the communication hub makes it extremely flexible while keeping ... Read More Source March 24th, 2014 No Comments # The tremendous innovation in Javascript There is no question that tremendous innovation is happening in the vast extended eco-system that touches upon Javascript. Sadly, I am not much interested in it. Maybe that is because I am not focused on the frontend right now. But also because I’m interested in solving these issues in other ways. All the same, Sam Ruby’s walkthrough of Angular.js is interesting: We have a model, view, and controller on the client, seemlessly interacting with the model, view, and controller on ... Read More Source March 24th, 2014 No Comments # Radical workarounds for the limits of MongoDB Whoa. This gives me interesting ideas: To reduce lock contention, we decided to run multiple MongoDB instances on one machine and create more granular databases in each instance. Basically data is stored in different instances based on its usage and in every MongoDB instance one database is created for each partner. Some people hate the fact that MongoDB forces you to do more in your own app, but I prefer designing with those constraints in mind. This has similarities to ... Read More Source March 24th, 2014 No Comments # Strange facts about HTML I feel like I’ve been away from HTML for awhile. 10 years ago I thought of myself as having some design skill, and I did a lot of front-end work, but in 2009, I moved to New York City and worked in some big companies with strict divisions of labor. I was a backender, and backenders are never frontenders. So I’ve been away from the frontend for awhile. It is slowly becoming foreign territory to me. I was surprised to ... Read More Source March 24th, 2014 No Comments # Why is the technology for blogs so difficult? The web is broken and can never be fixed. The only way forward is to start over again, to get rid of the Internet Protocol, the Transmission Control Protocol and the Hypertext protocol, and to invent a new protocol that fixes the problems with the old protocols. I am doubtful this can be achieved. Back in 2005, David Heinemeier Hansson offered a Rails tutorial showing how you could create a blog in 15 minutes : This was a world changing moment. ... Read More Source March 24th, 2014 No Comments # Once again, the shift to “smart services, dumb pipes” Yesterday I linked to the article over at Martin Fowler’s website where he wrote about the shift away from complex routing frameworks, towards a system of “smart services, dumb pipes”. Here is one more data point: At Digg our SOA consisted of many Python backend services communicating with each other as well as being used by our PHP frontend servers and Tornado API servers. They used Apache Thrift for defining the interfaces, clients and as the underlying protocol. …Coming off the Digg ... Read More Source March 24th, 2014 No Comments # Often businesses handle a degree of inconsistency in order to respond quickly to demand Perfect consistency is too rigid for most businesses, and it is painful when technical teams try to enforce this on a company, out of some ideological commitment to doing things the “correct” computer science way. “Eventual consistency” has been the standard that businesses have striven after since the Arab-Hindu cultures first invented dual-entry accounting, more than 500 years ago, and this is the standard that tech teams should enable for the businesses they serve. Choosing to manage inconsistencies in this ... Read More Source March 24th, 2014 No Comments # What kind of standards are useful to your team? I love this: Its a bit of a diochotomy that microservice teams tend to eschew the kind of rigid enforced standards laid down by enterprise architecture groups but will happily use and even evangelise the use of open standards such as HTTP, ATOM and other microformats. The key difference is how the standards are developed and how they are enforced. Standards managed by groups such as the IETF only become standards when there are several live implementations of them in the wider ... Read More Source March 24th, 2014 No Comments # The conceptual model of the world will differ between systems There is nothing wrong or bad about this, but rather, this is healthy: Decentralization of data management presents in a number of different ways. At the most abstract level, it means that the conceptual model of the world will differ between systems. This is a common issue when integrating across a large enterprise, the sales view of a customer will differ from the support view. Some things that are called customers in the sales view may not appear at all in ... Read More Source March 23rd, 2014 No Comments # A complexity that is frankly breathtaking How can anyone possibly think this is a good idea? To quote James Lewis and Martin Fowler: Certainly, many of the techniques in use in the microservice community have grown from the experiences of developers integrating services in large organisations. The Tolerant Reader pattern is an example of this. Efforts to use the web have contributed, using simple protocols is another approach derived from these experiences – a reaction away from central standards that have reached a complexity that is, frankly, ... Read More Source March 23rd, 2014 No Comments # The pushback against the monolithic framework I am pleased to think that others are as ready as I am to abandon the concept of the monolithic framework: Monolithic applications can be successful, but increasingly people are feeling frustrations with them – especially as more applications are being deployed to the cloud . Change cycles are tied together – a change made to a small part of the application, requires the entire monolith to be rebuilt and deployed. Over time it’s often hard to keep a good modular ... Read More Source March 23rd, 2014 No Comments # Leave the error checking in your code I leave the asserts in my Clojure code. I see a similarity of spirit expressed in the sentiment of James Hague (I especially like the use of the word “reckless”): That error checking is great during development was not controversial, but opinions after that were divided. One side believed it wasteful to keep all that byte and cycle eating around when you knew it wasn’t needed. The other group claimed you could never guarantee an absence of bugs, and wouldn’t ... Read More Source March 23rd, 2014 No Comments # A defense of MongoDB I posted this on Hacker News and now re-post it here. MongoDB offers the greatest benefit to those who have an evolving concept of their schema, and that tends to be startups, though I have worked in large firms that entirely re-invented their schemas. I worry that I would seem tedious if I listed the places that I have worked, and yet, on Hacker News, when I speak in abstract terms, I tend to get downvoted, so I will name a ... Read More Source March 21st, 2014 No Comments # Announcing Humorus-MG I just released Humorus-MG. This is an admin CMS for managing a collection in MongoDB. The app is written in Clojure. The README contains an unintentional mini-manifesto of what I believe about creating web software. This part in particular comes close to summarizing the kind of software that I would like to create this year: —————- Things that will never change about this app 1.) This app will never have more than 2,000 lines of Clojure code. None of my apps will ... Read More Source March 20th, 2014 No Comments # Why I use MongoDB I posted this on HackerNews. I am in agreement with what Jun Xu wrote. I think this is true: “For a technology startup with limited resources, broadly adopting a new DBMS means betting its own future on the DBMS. ” It has become popular to attack MongoDB, but I think it is difficult to get an objective view of what people are doing with it. If you want to read a really scathing attack on MongoDB, consider this post: http://www.sarahmei.com/blog/2013/11/11/why-you-should-never-use-mongodb/ But I recall reading that ... Read More Source March 9th, 2014 No Comments # Criticker gives away all of its users passwords Criticker gives away all of its users passwords in plain text. Of course the site is written in PHP. While you can make mistakes in any language, this kind of laziness is what you expect in PHP. Every request contains the secret key in the url. So all I need to do is capture a single request sent by the app and I have the key. Easy. My theory was that I’d get the list of users that the app had ... Read More Source March 8th, 2014 No Comments # Working with images using Clojure I am intrigued by Mike Anderson’s “imagez” library: Source March 8th, 2014 No Comments # How to monitor Clojure apps? Interesting: Powerful stream primitives (where (or (service #”^api”) (service #”^app”)) (where (tagged “exception”) (rollup 5 3600 (email “dev@foo.com”)) (else (changed-state (email “ops@foo.com”))))) Riemann streams are just functions which accept an event. Events are just structs with some common fields like :host and :service You can use dozens of ... Read More Source March 8th, 2014 No Comments # How to bankrupt a successful software company Interesting: Quark 5 and OS9 was what we were used to, but it was pretty miserable. The things that stick out: Restarting your computer and losing your unsaved work over software freezes was a regular part of your day. Like, many times a day. We had all these crazy workarounds to achieve certain effects like drop shadows or change-and-repeat. It was all pretty rudimentary and hard to standardize across many designers in a department. Shapes were pretty much a non-issue, so we had to ... Read More Source March 7th, 2014 No Comments # The downside of Unit Testing Interesting: I’m back in Java-land these days, which is culturally very pro-unit testing. After getting exposed to it again for a few months again I’ve come to side with the author here. I’ve never really been comfortable with the amount of time certain people dedicate to unit testing, especially the TDD crowd, but in my hiatus something has arisen in popularity which has made it all the worse: mockito. Prior to mockito, unit testing was (more or less) limited to testing that ... Read More Source March 4th, 2014 No Comments # Darren Holloway walks through the philosophy of Ring/Clojure Darren Holloway has written a post that should be added to the wiki on Github where Ring is hosted. He covers all the stuff that had me the most confused when I started doing web development with Clojure. He offers easy examples in pseudo-code to get the basic ideas across. I wish every project on Github had an introductory tutorial written in this style. An excerpt: Ring Conceptually Technically, Ring isn’t a framework or an application, but rather a specification ... Read More Source March 4th, 2014 No Comments # What it is like to think you are talented when you are ignorant Despite the “worst practices” approach, the thing worked. I like this story very much. My own story is a bit different, circa 2000-2005 I built a CMS out of PHP, and I did eventually find good ways to structure it, and I remain an opponent of “object oriented programming”. But other than that, a lot of this story overlaps my own. Despite what I now refer to as my “worst practices” approach, the thing worked. Every bad tutorial, every anti-PHP ... Read More Source March 3rd, 2014 No Comments # Emotional intelligence and success with Bitcoin Or rather, maladaptive ways to deal with stress: After Mt. Gox was hacked for the first time in summer of 2011, a friend asked Powell to help out, and soon, the San Francisco entrepreneur found himself on a plane to Tokyo. After landing, he rushed to Shibuya station, where he was met by his friend, Roger Ver, one of the world’s biggest bitcoin supporters who just happened to live across the street from Mt. Gox. Without bothering to drop off ... Read More Source March 3rd, 2014 No Comments # The difference between database indexes and database histograms Several things occurred to me when I read this, some of them off-topic, including my use of MongoDb, and how I have been unthinkingly re-creating histograms without even giving them that name. I do not regard that as a problem with MongoDb, it gives flexibility by doing very little itself, everyone using it is hopefully aware of the need to re-create database functionality within one’s own app. Then I asked myself the question: how does Oracle estimate that there are ... Read More Source March 3rd, 2014 No Comments # I still don’t get PAAS This is the problem for me: In my ideal world, deploying my apps wouldn’t require any platform-specific code, or if it did, that code would be portable between platforms. If I have to be aware of my servers, at all, then I’m still doing sysadmin, and if I have to do sysadmin, I want all the tools of sysadmin. I don’t want to do sysadmin on a crippled account that limits my options. Maybe someday there will be a real PAAS such ... Read More Source March 1st, 2014 No Comments # What is correct HTML syntax? Matias Meno of Colorglare asks the question “TO CLOSE OR NOT TO CLOSE?” This is from Ian Hickson in 2006, regarding the emergence of HTML5: Regarding your original suggestion: based on the arguments presented by the various people taking part in this discussion, I’ve now updated the specification to allow “/” characters at the end of void elements. To which Sam Ruby responded: This is big. PHP’s nl2br function is now HTML5 compliant. WordPress won’t have to completely convert to HTML4 before people who ... Read More Source February 9th, 2014 No Comments # Bloated software promises a stability which might be a liability Interesting: IT organizations are facing accelerating pressure to support companies’ growing need for business agility, innovation, customer responsiveness, and adaptability. This pressure doesn’t stop with so-called systems of engagement. It goes all the way back to systems of record. In fact, the distinction between the two is starting to erode. Enterprises are responding to this pressure by upgrading application architectures within and around the system-of-record tier. They are starting to view the “stability” of their legacy applications as a liability rather ... Read More Source February 9th, 2014 No Comments # Drupal is bloated software Stuff like Drupal offers ease of use for standard operations, and yet, when I work with clients, I find they have very few “standard” operations. Everything needs to be customized, and that is where Drupal becomes difficult: Drupal, much like many other CMSs, follows a development methodology that I call reverse development. It is the simple idea that the most fundamental moving parts of the technology have been already built for you, or are modifiable using a trivial UI, and ... Read More Source February 9th, 2014 No Comments # What is a Spruce Goose software project? I have worked on software that was just like this: Now the Spruce Goose was trying to fly. People couldn’t take their eyes off it. Who could comprehend its size! Three hundred-foot wingspan, seven-story tail, 200 tons of plane with room for 700 soldiers. It upstaged even the ocean liners lounging nearby. There it was,$25 million worth of prototype seaplane, skating along toward take-off, engines cackling and fuming. Howard Hughes, America’s most publicized aviator, designed it, swore by it, ...

February 9th, 2014

# Getting useful info out of git logs

Interesting:

git churn It’s possible to get valuable insight from history of a project not only by viewing individual commits, but by analyzing sets of changes as a whole. For instance, git-churn is a simple but valuable script that wraps git log to compile stats about which files change the most. For example, to see where the development of an app was focused on in the past 6 months:

$git churn –since=’6 months ago’ app/ | tail Incidentally, such analysis also highlights ... Read More Source February 7th, 2014 No Comments # The most difficult thing about programming Everything is working, then it breaks, and you never know why. The technologies we rely on are too numerous and complicated for us to understand them all — this is disempowering. I find this the most difficult thing about being a programmer: the surprise that even we can not understand all the technologies we use. When I’d finished this article, I wanted to spell check the content. emacs-ispell mode decided to go on strike. It could not find aspell, the ... Read More Source February 5th, 2014 No Comments # Trying to use the Formative library This is an interesting problem. I have records in a MongoDb database. Every record has a “created-at” field, stored as a datetime. This is a screenshot of the value: When I load this value via into a form via Formative, all the values fill in except for the year. Check out the screenshot: I wonder if this is a bug or if I have to reformat the datetime? Source February 5th, 2014 No Comments # Working with Clojure and Dire Here is a puzzle. I had a function “fetch” in namespace “controller” and it calls a function in namespace “secretary”. The secretary stores documents away, and then later gets them back, or it might return nothing, if there is nothing to return. I have a query “paginate-items” which on some pages correctly returns nothing, because on some pages there is nothing to paginate. But this function has been giving me problems. I decided to split my “fetch” function into a ... Read More Source February 4th, 2014 No Comments # Git is complicated because there are 5 places content can go Interesting: Why do these Places Exist? These places are part of the power and flexibility of Git over other systems. With an index, we can choose how to commit a multitude of changes, without having to commit to all our changes at once. This is a great flexibility. With the local repository versus the remote repository, we turn Git into a distributed version control system, where each repository is conceptually just as equal to the next one. With stash, ... Read More Source February 3rd, 2014 No Comments # Lisp as a mathematical formalism for recursive functions Very interesting: One of my favorite classes in school was 15-453: Formal Languages, Automata, and Computation, which used Sipser’s Introduction to the Theory of Computation: One aspect that I really enjoyed was that there was a narrative; we started with Finite State Automata (FSA), analyzed the additional power of Pushdown Automata (PDA), and saw it culminate in Turing Machines (TM). Each of these models look very similar and have a natural connection: they are each just state machines with different types ... Read More Source January 28th, 2014 No Comments # Mistakes that startups make This reminds of the change between Symfony 1.x and 2.0: The first mistake was made about 3 years ago, when we started building a new product, LiveRebel. When making the choice of technology we decided on the new web application framework Play, which fitted most of our development requirements very very well. Of course 6 months later Play committers made a decision to completely break backwards-compatibility for Play 2. So we were stuck with a technology that had an uncertain ... Read More Source January 23rd, 2014 No Comments # Why PHP is obsolete I was a huge fan of PHP back in the year 2000. I started using it right before the official release of 4.0 in April of that year. At that time, if you wanted to build a website, the main alternatives were C, Java, Perl and ASP. My criticism of those 4 were: 1.) C — too complicated for a fast changing website. Compiling an app took time, compiling was slower back then, there were fewer Open Source tools, and ... Read More Source January 23rd, 2014 No Comments # A language non-programmers praise while getting drinks at a bar Interesting: When out with friends recently, one of them mentioned how awesome Julia is. I was surprised to hear someone talk about it, even from another person in science. She turned and gushed about how awesome it was, how supportive the community was, even though she was “not really someone who likes programming.” And she liked it so much she was telling her friends about it at a bar! If you make a programming language that people who don’t like programming love ... Read More Source January 20th, 2014 No Comments # No one reads anyone else’s code for fun Interesting: First, when I did my book of interviews with programmers, Coders at Work, I asked pretty much everyone about code reading. And while most of them said it was important and that programmers should do it more, when I asked them about what code they had read recently, very few of them had any great answers. Some of them had done some serious code reading as young hackers but almost no one seemed to have a regular habit of reading ... Read More Source January 17th, 2014 No Comments # The argument against type hinting in Clojure Intersting: But it doesn’t hurt, does it? So one might think: “It doesn’t help, but it doesn’t hurt either, so I sprinkle hints all over the place to give myself some info on what this function returns.” And indeed this thinking is wrong. By now it should be clear that type hints are a low-level construct. Using them in the above mentioned way over specifies the types the functions take and return. You basically lock the code which could in theory be ... Read More Source January 17th, 2014 No Comments # Aliases for namespace qualified keywords in Clojure Interesting: However what happens if two libraries modify the same map? On the same key? Then we are in trouble, because one overwrites the other. To remedy this issue, Clojure provides (similar to Symbols) namespace-qualified keywords. Simply add a double-colon in front and the keyword will be qualified with the current namespace. user=> ::foo :user/foo Using the fully-qualified form, we can create keywords with an arbitrary namespace. user=> :my.company.division.project.module-a/keyword :my.company.division.project.module-a/keyword So we can communicate with a library using its own qualified keywords. The library can then ... Read More Source January 17th, 2014 No Comments # Why are forums so badly designed? This is ugly. I wonder why, since the 90s, forums have always had such awful graphic design? Source January 14th, 2014 No Comments # The awful magic syntax of Scala Interesting: Magic Syntax Over the years I’ve found that the easiest programming languages to get started with are those with the ‘simplest’ syntax. Simplicity is a hard thing to define, but in this case it is not the same as flexible. It’s actually closer to regular, or limited with extension options. I’ve found Lisp, Ruby and HyperTalk to be great examples. I’ve found AppleScript and Perl to be terrible. Scala is easily the worst I’ve seen. That punctuation-strewn build system I was forced ... Read More Source January 12th, 2014 No Comments # Your app is not real time if anything blocks Interesting: (My background: I work for Google, I did a real-time web prototype using the client libraries for GChat back in 2009 when real-time search was all the rage, my Noogler mentor at Google was the frontend tech lead for the eventual real-time search product we launched, and before Google I’d worked in financial software, where real-time responsiveness really is required.) I think that the folks currently building prototypes in Meteor dramatically underestimate the difficulty of scaling up real-time software to production-grade ... Read More Source January 11th, 2014 No Comments # Broken ad technology See screenshot. I was served the same ad 3 times on 1 page. This has to be a broken ad technology, yes? This cannot be deliberate: that would be idiotic. Source January 8th, 2014 No Comments # Functional programs are much more readily appreciated as mathematical objects Interesting: I write to you because of a rumor of efforts to replace the introductory programming course of our undergraduate curriculum the functional programming language Haskell by the imperative language Java, and because I think that in this case the Budget Council has to take responsibility lest the decision be taken at the wrong level. You see, it is no minor matter. Colleagues from outside the state (still!) often wonder how I can survive in a place like Austin, Texas, automatically assuming ... Read More Source January 6th, 2014 No Comments # Deploying Clojure to production This is a very interesting architecture for Clojure: And this is an interesting bit of deploy info: #!/bin/bash set -u # Treat unset variables as an error set -e # Exit immediately if a simple command exits with a non-zero status APP_HOME=/var/www/rssminer cd$APP_HOME

LOG_DIR=/var/log/rssminer sudo mkdir -p $LOG_DIR && sudo chown deploy$LOG_DIR

PER_MONGTH=date +”%Y-%m” # year-month- SHARED=”${LOG_DIR}/${PER_MONGTH}”

nohup java -server -Dsun.jnu.encoding=UTF-8 ...

January 5th, 2014

# What are the advantages of functional programming?

On Hackers News someone asked if those of us working with Clojure missed Object Oriented programming (OO). I responded:

Longer answer: an argument can be made that OO is a proven failure, and that there exists 40 years of evidence that OO is a failure.

I will start by talking about the general case, and then I will narrow my remarks to the web, since that is what you are asking about.

The least charitable way to describe OO is this: ...

January 4th, 2014

# The end of the HTML/CSS guru

I am actually surprised this lasted so long. I switched to CSS in 2000. But apparently what mattered was not the practitioners, but the browsers themselves: once they unified on standards, the need for specialists disappeared.

The reason the Web Standards Movement mattered was that the browsers sucked. The stated goal of the Movement was to get browser makers on board with web standards such that all of our jobs as developers would be easier.

What we may not have realized ...

January 3rd, 2014

# Where Google fails at search

Martins Programming Blog says there is a post on Hacker News with the title “Why is Clojure so slow?” So I run this search on Hacker News:

“Why is Clojure so slow” site:news.ycombinator.com

I get 9 pages of results, and none of them point me to the article.

Source

January 2nd, 2014

# Why git won as a distributed version control system

Interesting:

The third problem was Bazaars stance on completely immutable history. Software developers aren’t historians or lawyers. A lot of users were keen to keep their history simple. They cared about tracking what changes had happened to their trunk and having that sequence of changes be legible, not really about the individual commits and mistakes that led up to the patch that landed on trunk. Git offered “rebase”, which basically reapplies local changes on top of the upstream tree. Git is ...

December 31st, 2013

# Kill processes running on a certain port

This is clever:

searchAndDestroy() { lsof -i TCP:$1 | grep LISTEN | awk ‘{print$2}’ | xargs kill -9 echo “Port” \$1 “found and killed.” }

Source

December 29th, 2013

# How to listen for the delivery of a promise in Clojure

The problem:

add-watch was alpha in Clojure 1.2:

it is still alpha now, in Clojure 1.5:

I am curious what the plan is for add-watch?

http://nurkiewicz.blogspot.com/2013/03/promises-and-futures-in-clojure.html

and this struck me:

And here is where the greatest disappointment arrives: neither future nor promise in Clojure supports listening for completion/failure asynchronously. … As much as I love Clojure concurrency primitives like STM and agents, futures feel a bit underdeveloped. Lack of event-driven, asynchronous callbacks that are invoked whenever futures ...

December 29th, 2013

# The lack of future listeners in Clojure

Interesting:

Futures behave pretty much the same way in Clojure from user perspective – they are containers for a single value (of course it can be a map or list – but it should be immutable) and trying to dereference future before it is resolved blocks. Also just like promises, futures can only be resolved once and dereferencing resolved future has immediate effect. The difference between the two is semantic, not technical. Future represents background computation, typically in a thread ...

December 28th, 2013

# How to reload an app in the REPL with Clojure

I have been working with Clojure now for much of the last 18 months, so I am no longer a complete noob. But I was slow to get used to working at the REPL. Working with the REPL is taken for granted in the Clojure community, so much so it is tough to find step-by-step instructions for doing it. For me, a confusing issue for a long while was, if I wanted to get back into an app, what are ...

December 27th, 2013

# Clojure is concise

So now I have a working, multi-threaded CMS, written in Clojure, and it has less than 1400 lines of code. This is a beautiful language.

Source

December 25th, 2013

# Dependency Injection is more important than dependency injection frameworks

Interesting:

I was one of the earliest adopter of Spring in Norway. We developed a large system where we eventually had to start thinking about things like different mechanisms for reuse of XML configuration. Eventually, this evolved into the @Autowire and component-scan which took away the problem with huge configuration files, but in return reduced the ability to reason about the whole source code – instead isolating developers in a very small island in the application.

The applications tended to blossom in ...

December 25th, 2013

# The rise of the algorithm economy

Interesting:

They seem to agree on one thing: from a workaday perspective, math is essentially useless. Lisp programmers (we are told) should be thankful that mathematics was used to work out the Lambda Calculus, but today mathematics is more a form of personal enlightenment than a tool for getting anything done.

This view is mistaken. It has prevailed because it is possible to be a productive and well-compensated programmer — even a first-rate hacker — without any knowledge of science or ...

December 25th, 2013

# Why Scala should be outlawed

Wow:

/** * Join 22 futures. The returned future is complete when all * underlying futures complete. It fails immediately if any of them * do. */ def join[A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V](a: Future[A],b: Future[B],c: Future[C],d: Future[D],e: Future[E],f: Future[F],g: Future[G],h: Future[H],i: Future[I],j: Future[J],k: Future[K],l: Future[L],m: Future[M],n: Future[N],o: Future[O],p: Future[P],q: Future[Q],r: Future[R],s: Future[S],t: Future[T],u: Future[U],v: Future[V]): Future[(A,B,C,D,E,F,G,H,I,J,K,L,M,N,O,P,Q,R,S,T,U,V)] = join(Seq(a,b,c,d,e,f,g,h,i,j,k,l,m,n,o,p,q,r,s,t,u,v)) map { _ => (Await.result(a),Await.result(b),Await.result(c),Await.result(d),Await.result(e),Await.result(f),Await.result(g),Await.result(h),Await.result(i),Await.result(j),Await.result(k),Await.result(l),Await.result(m),Await.result(n),Await.result(o),Await.result(p),Await.result(q),Await.result(r),Await.result(s),Await.result(t),Await.result(u),Await.result(v)) }

Source

December 25th, 2013

# When programmers say “types” what do they mean?

Interesting:

What To Know Before Debating Type Systems I would be willing to place a bet that most computer programmers have, on multiple occasions, expressed an opinion about the desirability of certain kinds of type systems in programming languages. Contrary to popular conception, that’s a great thing! Programmers who care about their tools are the same programmers who care about their work, so I hope the debate rages on.

There are a few common misconceptions, though, that confuse these discussions. This article ...

December 25th, 2013

# Music is training for concurrent programming

I suspect music is the best training for concurrent programming. This is interesting:

Rich Hickey Q&A by Michael Fogus Best known as the inventor of Clojure, a Lisp that runs on the Java Virtual Machine and the first new member of the Lisp family to attract any widespread interest since Scheme and Common Lisp, Rich Hickey has been a software developer and consultant for two decades. Prior to starting work on Clojure, he made four attempts to combine Lisp with either Java or ...

December 25th, 2013

# What I learned from Clojure

Someone on Hacker News asked “Why use Functional Programming”. I wrote this in response:

I don’t have an answer for you, but for the last year I have been learning Clojure, and I have found it fascinating. I do not think this will satisfy you, but I would like to share some of the things that I find interesting about Clojure.

Lately I’ve been studying the source code of Aleph, a web server written by Zach Tellman. Zach is a very ...

December 24th, 2013

# This is why Rails sucks

50 minutes of Yehuda Katz talking about Rails. He says he wants to answer the accusation that there is too much bulk in Rails. He argues that much of that bulk is doing important stuff that you don’t know about, and that if they do their jobs intelligently, as designers of a framework, you should never have to think about all the cool stuff that the code is doing for you, but that you should be grateful that all that ...

December 21st, 2013

# How to organize namespaces in Clojure?

This is good:

You can think of namespaces as a tool to express something about your application. Here are some ideas to get you started:

Group functions into namespaces based on type of data they manipulate. For example, functions to manipulate customer data go in the “customer” namespace. This technique is familiar from object-oriented languages, but it has the same limitations: where do you put functions concerning relationships among two or more types? The OO answer would be to make a new ...

December 19th, 2013

# Contract programming in Clojure

I’ve been drifting toward a contract style in my Clojure code. Among other things I’m discovering along the way, I’ve been rethinking the need to be clear about arguments to function. This used to seem like the Right Way:

December 17th, 2013

# The marriage of math and computers

Amazing:

On last Thursday at the Heidelberg Laureate Forum, Vladimir Voevodsky gave perhaps the most revolutionary scientific talk I’ve ever heard. I doubt if it generated much buzz among the young scientists in advance, though, because it had the inscrutable title “Univalent Foundations of Mathematics,” and the abstract contained sentences like this one: “Set-theoretic approach to foundations of mathematics work well until one starts to think about categories since categories cannot be properly considered as sets with structures due to ...

December 14th, 2013

# Can we avoid firewalls by re-using obsolete ports?

Port numbers under 1000 tend to be associated with essential network services, so these are the least likely ports to be restricted by firewalls, yes? I am looking through the list of unix network socket ports on Wikipedia.

I am intrigued by the number of lower port numbers that seem to be available because their original official use has gone obsolete.

For instance: http://en.wikipedia.org/wiki/Xerox_Network_Systems

had ports 52, 54 and 58. Current status:

“Last used by Xerox for communication with the DocuTech 135 Publishing System, ...

December 14th, 2013

# When will media companies understand technology?

The big media event today is Beyonce’s new video. So of course, the server is down? The error is a WordPress error. When do media people learn how to use software and servers to handle big spikes in traffic?

Source

December 13th, 2013

# The heyday of unix

Usually when something is at its peak its victory seems so natural that no one really appreciates it in the moment. It is worth noting that Microsoft lost momentum with developers at some point in the period 2000 – 2005, and this has lead to the heyday of Unix, mostly Linux. All of the competitors are dead. I am not sure what comes after Unix, but it is noteworthy how much it dominates the landscape for applications that do not ...

December 9th, 2013

# Find all field names in MongoDb

I am switching from PHP/Mysql to Clojure/MongoDb. In Clojure, hyphens in names are more idiomatic than underscores. So I wanted to find all the field names, so I could see where the underscores were. Having done a straight import of all of my database tables into a single MongoDb collection, I ran this Javascript at the Mongo shell to get all the field names.

var arrayOfFieldNames = [];

var items = db.tma.find();

while(items.hasNext()) { item = items.next(); for(var index in ...