Fall is here in NW Ohio/SE Michigan, even though baseball season hasn’t quite ended yet. And along with the change in seasons, we get falling leaves, frosty mornings, and barn spiders. (If you’ve read or seen Charlotte’s Web, the titular character is a barn spider although much more personable than the real thing).
I got this idea from Scott Kelby while attending his Shoot Like a Pro tour. After trying it for a week or so now, I’m thrilled with the results (and mad that I didn’t do it sooner). If you use LightRoom or Aperture, and your camera supports it, you can shoot images and have them show up on your computer screen within a second or two (its called “shooting tethered”). Even though I have been aware of the concept, and tried it out briefly a year or so ago on my laptop, I didn’t really seen a benefit to. Continue reading
Adobe Goes into the Clouds - a few weeks back, I blogged about Adobe’s moving to a subscription only version of their products called Creative Cloud. At the time, I thought the idea was a bad one, but that a lot of people would probably sign on anyway. Adobe has thrown out figures that they have signed up in excess of 700K subscribers to their Creative Cloud platform, but they don’t say if that’s in line with what they were expecting. In light of some recent events, I’m thinking the number of subscribers have not been up to what they had planned. Continue reading
- PDI is an open source tool used to extract data from one source, transform it into other formats, and then load it into a destination. This process is called ETL, and its a big part of my day to day job. We’ll use PDI because its free open source software, and because it supports connecting to big data systems.
- MongoDB is an open source document NoSQL database system. Rather than storing data in rows and columns like a traditional database system, Mongo stores information in documents. This allows for quicker retrieval of information, but does require some getting used to how information needs to be formatted.
When creating ETL workflows, its useful to store the information in a database repository, rather than as individual files on your workstation. This allows multiple users to have access to the information (why recreate the wheel?), it allows you to pull it into your jobs quickly and easily, and you can back it up quickly and restore it if necessary.
Pentaho Data Integration (aka Kettle) is an open source ETL tool that has a repository feature, which allows you to store your transformations and job in local files, or in a central repository database. The file option is pretty easy to implement, so I won’t cover it here. Because of my work experience, I prefer to use a database server based repository. Unfortunately, the documentation for setting up a DB repository is sorely lacking (a common problem with a lot of open source projects). After some experimenting, I did figure out how to create a MySQL based repository, and how to connect to it from a Linux bases installation of PDI. Here is a walk through of the process: Continue reading
By default when MariaDB or MySQL is installed in Ubuntu, it is set to run automatically when the system starts up. For a server that makes sense, but if you are working on a development machine, like a laptop, you may not want it to always be running.
(BTW MariaDB is a fork of MySQL started by one of the original MySQL developers).
To disable the automatic start of MySQL on Ubuntu, open a terminal session and enter this command: ls -l /etc/rc?.d/S*mysql*