Longest Document I’ve Ever Published
ith recent milestones reached and a change in publication strategy, I decided to unleash some easy-to-run code along with the 535 pages which document the code as well as experiments that have been run using this code over the past year or so. The document is not as organised as it ought to be, but I hereby make a lot of stuff available in fairly raw form. I made an HTML version to overcome file size issues that are associated with PDF, not to mention accessibility issues.
Surface Matching With Statistics and Geometry: Technical Report for 2011 (PDF – 535 pages, HTML – multi-page layout)
In summary, performance verging state-of-the-art is reached. In order to pile up and share the approach/work done thus far (before proceeding with the exploratory direction of diffusion/other), these are hastily-assembled notes that weigh 60 MB (over 500 pages for this document) which contain a lot of material that can prove useful some time along the way. Here is a previous post with some shorter documents. They are all being uploaded as HTML and PDF once they become coherent enough, soon to be indexed and made accessible through the research page and Web search as well.






Filed under:
AST year (or at the end of 2010 to be more accurate) I started posting a lot more often about my work in this blog. I resumed what I had done years beforehand, even if that meant a loss of focus for this personal blog. But then again, this blog hardly ever had any single point of focus, so after the upcoming release of one gigantic PDF I will resume posting updates about my research on a more regular basis. This ought to achieve a more transparent communication of ideas. It is also a departure from the disappointing pace of blogging (about a post per week as of late).
RARELY find the time or even the need to write about my research in this blog, either the Linux- and software-related research or my job where I study geometric methods and develop new techniques that nobody tried before. The reason for inaction regarding the latter is that I cumulatively prepare a large report (over 300 pages so far) which will be published in full in this Web site when it’s all polished, along with code. I posted
O, TODAY was a little dull. In the midst of boredom scripts were produced to turn a sequence of images into an animation of small size with an example of crop, border, and resize (ImageMagick required). Let’s go through this one step at a time. Put all the images from the sequence in a working directory.
HE FOLLOWING research notes have just been uploaded, hopefully to help others understand the work undertaken and what ideas are being tested (generalised to the extent of being applicable to any type of data, including biomedical). The 200+ pages long document which is more complete is still work in progress and it will be published at a later date. Thanks to the
INALLY, after nearly a week since taking and returning from a break, I received the server I have been waiting for. It arrived on Monday afternoon and today I finished configuring it, running alongside the triple-head workstation which I will use to do computer vision jobs and other development/admin work. The research requires a lot of screen space and sometimes the RAM/CPUload goes rather high, which helps not at all unless there is a second motherboard at hand (and a fallback machine.


t various stages throughout my career (I am 29 now), myself and others pondered starting a blog about Open Access, Open Data or open research (opening one’s lab, including data, methods, etc.), but since I already spend more time than I can afford advocating GNU/Linux, I ended up spending a lot of my energy fighting against software patents. This is one aspect among several involving the sharing and transparency of the sciences. There is of course also a dimension associated with copyrights and despite its importance I no longer have time to pursue the issue; many people already do so anyway. To put the core argument succinctly, nowadays when we have the Internet and we have a presentation layer such as the World Wide Web (for exchange of inter-connected rich media), we no longer depend on many analogue technologies and the notion of scarcity must cope and catch up with what’s available. To expect people to publish their findings only in paper form with pagination is to pretend that we are locked into legacy, which we should not. Computer Vision is considerably limited by paper. There are much better and faster ways of getting messages across, especially in this age of information overload. Moreover, travelling for presentation to a crowd (conference) is less necessary now that many people’s Internet connections permit video streaming at a good resolution. Romanticising over the nostalgia which is 20th century-esque research is no longer beneficial as it leads to inheriting inherent limitations. I was reminded of this in an IRC conversation last night. It was agreed upon that the theory about publishers exploiting academics to write, review and even edit entire publications for the publishers to profit from at the expense of those academics is something that needs to be stopped. Many academics these days have their own homepages and sometimes their blogs too. They can publish a lot of material there and let the quality/accuracy be determined by citation, e.g. something like the PageRank system which merely inherits the ideas of algorithms before its time (take away Google’s/Stanford’s patent while at it). The issue of course is that people cannot reference papers by Internet addresses, at least not by conventional means. The idea that papers should be accessible through libraries though is outdated as libraries too are going somewhat extinct and the speed of working there is inferior. Other than self pride and honour from peers, what incentive is there really to being heavily involved in the publication industry which benefits publishers and offers writers not so much exposure anymore? Fewer people seem to be searching journals; they use external search engines or Wikipedia instead (it’s multi-lingual). The debate becomes ever more relevant now that Aaron Schwartz is being hounded for just doing his job.