Wednesday, August 22, 2007

More Subversion Scheme Bindings (this time for mzscheme)

The mzscheme bindings for subversion should now mostly work. As is with any software project, there are probably quite a lot of bugs, if you find any I'd love to hear about them (e-mail holden@pigscanfly.ca ) or if you use the bindings and they do work for you I'd been even more happy to hear about that :).

You can check the latest version out of the scheme-bindings branch from http://svn.collab.net/repos/svn/branches/scheme-bindings/ . To build the mzscheme bindings just run "make swig-mzscm", it even produces a plt package (although theres a little bit of extra cruft in the package). You can get an idea of how to use them by looking at the test file (subversion/bindings/swig/mzscheme/tests/run_tests.scm) , along with looking at the exported symbols (subversion/bindings/swig/mzscheme/README).

Monday, August 13, 2007

Next Generation C++

Awhile back I wrote about a talk given on C++ to the University of Waterloo Computer Science Club. Permission has recently been obtained to post the video files, so you can now watch the talk yourself. Dr. Bjarne Stroustrup, creator of C++, discusses new features that will be present in the next version of C++ (tentatively C++09). He addresses many important questions form the audience, and gives his views on C++ compilers, including GCC and Visual C++.

read more | digg story

Sunday, August 05, 2007

Getting REXML to play nicely with the non-english world

Its a large world out there, and not everyone uses the same character set. REXML does its best, converting everything into UTF-8 so you don't have to worry. Unfortunately older versions of REXML (such as version 3.1.2.1 presently in Ubuntu) fail to correctly parse most XML feeds encoding specification. Fortunately, the latest version (3.1.7 as of this writing) has fixed the regular expression to properly match the encoding types. This is great for importing non UTF-8 documents, but it simply reads in UTF-8 documents without first cleaning them. Most of the encoding & decoding in REXML is done using Iconv, so with a small patch against version 3.1.7 (based on Secure UTF-8 Output in rails) we can make REXML strip out invalid UTF-8 characters.

Not only does this help our application, but also for any XML documents we produce it ensures that we actually are following the encoding, making the world a slightly better place.

Free Blog Counter