Posts in The Web Problem

Tuesday, August 19, 2008

Let’s make SXSWi 2009 suck less!

We Suck Less "VISI Hat Back"

Remember when you went to SXSWi last year, and you said “I love the parts where I meet cool people and eat free food and drink free booze and throw up, but I wish the presentations and panels weren’t so goddamn fluffy?” Me too. That’s why Alex Payne (aka “The Guy Who Has Actual Name Recognition”) and myself submitted the talk “Security for the Social Set.”

The idea is that we give some solid, useful information about the security problems social networking apps have to deal with, and how to deal with them. While we can’t get too focused on specific languages and frameworks, client-side defense with JavaScript will certainly be demonstrated, and I intend to show examples in PHP and probably a couple other platforms (coughRailscough). It will be hard to get into heavy detail within the alloted time, but it’s my hope that we can kickstart awareness and understanding of fundamental secure web app programming techniques.

Plus, I need a justification for dropping the coin for hotel and air fare on this boozefest, so please, vote for us.

Oh, and a few other meaty talks you should consider include:

Posted in General, JavaScript, InfoSec, The Web Problem, PHP by funkatron on 08/19 at 10:52 PM
(4) CommentsPost a comment

Wednesday, June 25, 2008

Give me your money, suckers

Gopher Robbery

I have these five promo codes from Dreamhost. They expire in three days. Apparently you get super-hot deals with them, like:

  • Four times the normal disk and bandwidth
  • $150 off a 5-year plan
  • $200 off a 10-year plan

I would not sign up for 5 years with any shared host. However, I do think DH is pretty decent for small-ish sites that can work with shared hosting. I’m well aware that they’ve had a couple embarrassing public flubs in the past couple years, but I generally think they’re good folks who try to do right by their customers. Some will certainly disagree, of course. 8)

I still maintain my old account there for low-cpu, high-bandwidth/storage stuff. They do quite well with PHP hosting, and can handle Ruby and Python setups with a little effort. They’ll let you do crazy stuff like compile your own PHP if you are fine with using FastCGI, and have a nice GUI to setup SVN repos. Not many shared hosts will do the same at a similar price point.

Anyway, here are the codes. They’re first come, first serve. Yes, I make money if you sign up with one of them. I will not be hurt if you don’t use them.

  • 998942362256
  • 916860787004
  • 591140700044
  • 047271781406
  • 160643597831

Dreamhost Signup Page

Posted in Development, The Web Problem, Python, PHP by funkatron on 06/25 at 05:26 PM
(0) CommentsPost a comment

Wednesday, June 18, 2008

PHP Job Opening at Purdue

Ross-Ade Stadium at Purdue University

I’ve been working at Purdue for over 6 years now, and I have to say that I’ve enjoyed it immensely. While the experience can vary from department to department, in general it’s a fairly low-pressure environment with a lot of opportunities to learn and develop new skills for motivated folks. Purdue offers great benefits, and the Greater Lafayette area is a pretty cool, decently progressive environment considering the size – US News & World Report recently picked West Lafayette one of the “Best Places to Retire” in the US.

A job just opened up in the Math department for a “Web Programmer/Analyst”. This position calls for strong PHP, PostgreSQL and MySQL skills; Linux and Unix fu (you should be able to admin the web stack); good communication skills (you’ll be working with a number of different people with varying tech backgrounds); and plenty of independent motivation.

I’m not involved in the hiring process at all, but I’m friends with those who are, so if you have specific questions, I can route them to the right person. Please email me directly – do not post questions in the comments.

Posted in Development, The Web Problem, PHP by funkatron on 06/18 at 01:05 PM
(1) CommentsPost a comment

Tuesday, June 03, 2008

Google App Engine from a PHP developer’s perspective

My cheatin' heart

When Google AppEngine was announced, a lot of sturm und drang went through the PHP community because it only supported Python. Of course, the same thing went through the Ruby community. The Perl community, not so much, because they’re used to getting ignored.

Anyway, I didn’t really give a crap, because Python is a cool language, and I’ve been interested in learning new languages and frameworks lately. PHP is cool and all, but having done it for 9 years, you kinda look for new challenges. Plus, getting hung up on one language is a sure way to be a boring dude who gets caught up in lame religious language wars.

I’ve actually toyed a little with Python here and there, but never dove in on a significant project. I’m someone who really likes whitespace and indenting rules and such, so those aspects of the language never bothered me. Otherwise, Python is a good fit for me philosophy-wise, because it’s totally the anti-Perl, lacking goofy magic characters and 1000 ways of doing the same thing. I can’t stand languages where “~^s” means “find all numbers that start with the letter ‘s’, add them, multiply them by 75, and then post the result to Twitter”. I’m not freakin’ R2D2 here.

So anyway, I started working on a little side project with Alex Payne on GAE. It’s been an enjoyable experience to work with a new language and framework, and there’s a lot to like about what GAE offers. It also has some rough edges, though, in some areas you’d be surprised by.

The good

The webapp framework

The built-in framework for handling requests is webapp, a nicely straightforward system for mapping requests to handlers.

class MainPage(webapp.RequestHandler):
    def get(self):
        self.response.out.write('Sup dogg?')

class FooPage(webapp.RequestHandler)
    def get(self):
        self.response.out.write('This is the foo page')

def main():
    wapp = webapp.WSGIApplication([
                                    ('/', MainPage),
                                    ('/foo', FooPage)
                                  ],
                                  debug=True)

    wsgiref.handlers.CGIHandler().run(wapp)

So we define a class to handle a specific URL request, and define methods for GET or POST or whatever HTTP commands inside those classes. Then in the main function we map URLs to those handler classes in the webapp.WSGIApplication constructor, and then start it up.

I like this because it’s simple and easy to follow. I could read the code before I’d written any myself and get what’s going on. There’s no magic happening here; no auth-mapped stuff or relationships established because I happened to name something the right way and put it in the right directory. Convention is good, and we have conventions here, but the conventions are clear and obvious.

Django templating

Django has a really nice templating system. GAE lets you use it outside of the whole Django stack. This is cool. For example:

# define some values to plug into the template
tpl_data = {
    'page_title':"This is an awesome page",
    'body':"Here's the page body, sucker"
    }

# get the template path
path = os.path.join(os.path.dirname(__file__), 'index.html')

#render and output the template
self.response.out.write(template.render(path, tpl_data))

Here’s a super simple index.html template:

<html>
    <head>
        <title>{{ page_title.striptags }}</title>
    </head>
    <body>
        {{ body }}
    </body>
</html>

Google User Auth

Writing a user system is a pain. Making people set up new accounts to use your site is yet another barrier to experiencing your amazing new Google sellout project site. You could presumably do these things if you want in GAE, but most of the time that seems kinda stupid. GAE offers you very easy integration with Google user accounts, so you can use that as your authentication system. Since everyone and their mom has a Gmail account, it’s a pretty good bet many or most of your users will be able to skip account setup.

Pricing

GAE is free for your first 500MB of storage and “around” 5 million pageviews. That’s plenty to host a smallish app or do initial testing on something you want to get big (because big == profit, right?). If you push past that, the pricing is in-line with similar services from Amazon. That means it’s totally reasonable to set up an app just for goofing around and learning.

Python

Python is a cool language: easy to pick up and quite powerful. It doesn’t fit some tasks as well as PHP IMHO, but it does a ton of stuff well, and its large community means there is lots of support and library development. Some stuff it does really well includes:

  • Network programming (example)
  • Parsing HTML
  • Email parsing and manipulation
  • OS X scripting
  • String manipulation
  • Unit testing
  • Other stuff I probably don’t know about

AppEngineLauncher

Recently the GAE team released the Google App Engine Launcher, an app for OS X that handles launching the development server and deploying your app to Google’s servers. You can of course script all this stuff yourself on the command line, but for folks less comfortable with that, this is a godsend. I actually did use the CLI before, but I don’t touch it now because the GAEL is just so fly. Single-click deployment is especially awesome.

Leveraging Google’s Infrastructure

They gots some big computers, rite? You’d certainly think so, and they probably have better uptime than your $5 shared hosting account. We hope.

The lame

Datastore query limitations & primitive text search

The Datastore API is GAE’s database system. It has some similarities to traditional relational DBs like MySQL and PostgreSQL, but it is fundamentally different. It stores data objects as entities instead of rows, and properties instead of columns. Objects are described by model classes, defined in Python:

class Page(db.Model):
    title = db.StringProperty(required=True)
    keywords = db.StringProperty()
    date_added = db.DateProperty(required=True)
    date_updated = db.DateProperty(required=True)
    body = db.StringProperty()

Models are to entities as classes are to objects, then (I think).

The object approach has some cool flexibility, like expando models that let you add arbitrary properties to entities (very unlike rows in a relational database).

You can do SQL-like queries against the Datastore with GQL. It’s a lot like SQL, but a lot more basic. In fact, this is a summary of the entire language:

SELECT * FROM <kind>
[WHERE <condition> [AND <condition> ...]]
[ORDER BY <property> [ASC | DESC] [, <property> [ASC | DESC] ...]]
[LIMIT [<offset>,]<count>]
[OFFSET <offset>]

<condition> := <property> {< | <= | > | >= | = | != } <value>
<condition> := <property> IN <list>
<condition> := ANCESTOR IS <entity or key>

This lets you do basic queries, but it’s missing a lot of powerful stuff present in SQL. Most notably there is no equivalent to the LIKE comparison operator. That means there is no way to do partial string matches!

Okay, so you can do kludgy prefix matching, and there is an as-of-yet undocumented whole word search using the SearchableModel class, but those are primitive compared to the string matching capabilities in MySQL. Personally I would’ve thought that the search company would have given GAE really awesome text search capabilities out of the box, but for now folks have to roll their own hacks.

I guess now I have an idea of why Gmail’s search can’t match partial strings.

No “beginner” docs

GAE’s docs are aimed at experienced developers who have already built web apps. Most of it is reference, and it’s a little light on examples. If you’ve not used Python before, here’s the entire introduction form the GAE docs:

For more information about Python, see the Python website and the Python documentation.

I think Python is easy to pick up, but GAE doesn’t even bother pointing out some good tutorials for beginners. Compare this to The Django Book or The Definitive Guide to Symfony, both of which aren’t really for beginners, but do a good job of helping them get up to speed.


I’m early in my development experience with GAE, so this is by no means comprehensive. Google has already demonstrated that the GAE feature set will be growing, having introduced memcache and image manipulation APIs just a week ago. I’m sure the documentation will be improved and supplemented by third parties in short order, and the issues with text search will almost certainly be addressed soon – it’s an obvious deficiency.

For now, I’m having a lot of fun working with GAE. It’s a solid platform with a lot of promise, and it offers a lot of learning opportunities for folks who want to explore web app dev with Python.

Posted in Development, The Web Problem, Python, PHP by funkatron on 06/03 at 09:11 PM
(14) CommentsPost a comment

Friday, December 28, 2007

Assumptions and Mac browser share

Logic Lane

If there’s one thing I’ve learned in my years as a grown-up, it’s this: Don’t make assumptions. All too often, things I’ve thought were no-brainers turned out to be questionable at best, like:

  • John Romero will make an awesome game on his own
  • Sony and Microsoft have made Nintendo irrelevant
  • Women will find my company tolerable

I often see techie types making assumptions based on their experience, like “program A is unusable” (because on his rather oddly set-up system, it crashed on install) or “no real programmers write in Ruby” (because he doesn’t know anyone personally who do) or “only a tiny fraction of Mac users browse with Firefox” (because it’s the default browser out of the box, and average users always use the default, right?).

It was this last assertion, and the accompanying statement “I don’t have to prove something so obvious” that inspired this article. I wouldn’t have been particularly surprised if Safari had a majority of the Mac browser market, but my personal experience had been that a fair number of non-techie Mac folks use Firefox. Beyond that, I wanted to get some real data, and not make assumptions. If you can’t provide some basis for your assertions, they’re not worth much. It’s a clear sign of a bad argument, and little gets me more worked up than weak logic.

The audience for a web site has a pretty significant impact on what browsers we see visitors using. Tech-oriented sites tend to have a higher percentage of users running alternate browsers like Firefox. You’ll get more Camino users on a Mac-oriented site, and Konqueror will show up more often on sites with content for Linux users. That means we have to at least take into account the content of the site when drawing conclusions from the numbers.

Here’s what I’ve found myself, and what others have shared with me, so far. Understandably, some folks shared their stats with me asked to remain anonymous, or that the site they come from not be revealed. I’ve described them as best I can. Also note that in some cases I received just info on Mac browsers, and others I had data for all browsers. I’ve calculated a ratio and “winner” at the end of each entry.

Is this data conclusive? Of course not. It’s a small sample of a handful of sites, and not particularly popular ones at that. But, it does seem to at least indicate that Firefox may be quite a bit more popular among Mac users than I – and certainly other people – thought.

If you’d like to contribute data, just drop me a line or leave a comment.

Funkatron.com

This site, obviously strongly tech-oriented, but also gets a decent amount of sport-oriented search engine traffic

Browser % of total
Firefox/Macintosh 19.31
Safari/Macintosh 13.38

Approx ratio: 1.5/1 Firefox

Sorority web site

Non-tech focused, active site for members of sorority

Browser % of Mac
Safari/Macintosh 48.44
Firefox/Macintosh 41.78

Approx ratio: 1.1/1 Safari

EricaBaker.com

Personal blog, significant tech/mac content

Browser % of Mac
Firefox/Macintosh 53.86
Safari/Macintosh 40.19

Approx ratio: 1.3/1 Firefox

Abandoned non-tech subculture site

Very low, mostly search engine traffic

Browser % of total
Firefox/Macintosh 10.75
Safari/Macintosh 10.75

Approx ratio: 1/1

Abandoned personal blog

Some tech/mac content

Browser % of total
Firefox/Macintosh 61.15
Safari/Macintosh 36.21

Approx ratio: 1.7/1 Firefox

New sports blog

Browser % of total
Safari/Macintosh 50.74
Firefox/Macintosh 45.92

Approx ratio: 1.1/1 Safari

Abandoned sports web site

Very low, mostly search engine traffic

Browser % of total
Safari/Macintosh 3.43
Firefox/Macintosh 1.75

Approx ratio: 2/1 Safari

Low-traffic, non-tech site

Browser % of Mac
Firefox/Macintosh 74.15
Safari/Macintosh 25.85

Approx ratio: 3/1 Firefox

Posted in General, The Web Problem, OS X by funkatron on 12/28 at 10:24 PM
(3) CommentsPost a comment
Page 1 of 15 pages  1 2 3 >  Last »