Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Mozilla The Internet

Incorporating Machine Learning into Firefox 2.0? 806

blakeross asks: "I will be doing research this summer at Stanford with Professor Andrew Ng about how we can incorporate machine learning into Firefox. As we work to finish up Firefox 1.0, we're also seeking ideas that will make Firefox 2.0 blow every other browser out of the water. People who come up with the best 3-5 ideas that involve the use of machine learning will win Gmail accounts, and if we implement your idea you'll be acknowledged in both our paper and in Firefox credits. Your idea will also be appreciated by the millions of people who use Firefox. We'll also entertain Thunderbird proposals. See my weblog post for more details; I'll read all comments posted in response to this story or to my weblog."
This discussion has been archived. No new comments can be posted.

Incorporating Machine Learning into Firefox 2.0?

Comments Filter:
  • The top five ideas (Score:5, Interesting)

    by Anonymous Coward on Saturday July 10, 2004 @10:35PM (#9663953)
    Here are the best five ideas incorporating machine learning:

    1. Based on the user's browsing habits, automatically bookmark the most frequently visited sites, and automatically put them into *multiple* categories (not just one category) to make them easy to find.

    2. Create a full-text index in real-time of every page that has been browsed. When the user visits any web page, display a sidebar of "Related previously-viewed pages."

    3. A Google-News-like consolidation feature for the user's most-frequently visited news site, automatically highlighting stories of interest based on ones they've previously viewed.

    4. Allow user to select "Fewer images like this" or "More images like this" or "Less text like this" and "More text like this" and using Bayesian or other similar filters, automatically block or highlight content. For blocking advertisements, or highlighting certain key passages.

    5. Allow the user to browse their own hard drive, and categorize content automatically ("this is a document about lambs" ... "this is a picture of a sunflower") and let them group and search for items. Eg. "Pictures like this" or "Documents about cats."

    Please give my Gmail accounts to Gmail for the troops [isipp.com].
    • by bonch ( 38532 ) on Saturday July 10, 2004 @10:47PM (#9664025)
      ...a browser that doesn't have machine learning in it. Seriously, Firefox is slow enough for me. What on earth would you possibly need "machine learning" for in a web page browser? I'd immediately switch back to Opera (I don't use it simply because input forms lag during page-loading, some sort of multithreading issue).

      That kind of automatic crap is the same sort of stuff people would bitch about if Microsoft put it into IE. I mean, do you really want your browser actually learning anything about you? Imagine the havoc it could wreak, especially if trojans started fucking around with it.

      Just give me the leanest, meanest browser out there. That's all Firefox 2.0 needs to be. Not a damn learning machine. Sheesh.
      • by Anonymous Coward on Saturday July 10, 2004 @10:53PM (#9664068)
        Damn right!

        Looks like these guys are just looking for a place to dump their thesis after they finish.

        Thanks but no thanks.

        A browser doesn't really need machine learning as far as I'm concerned.

        If you want to waste a shitload of resources and bloat up some app add machine learning to emacs or something but not my browser!
      • by That's Unpossible! ( 722232 ) * on Saturday July 10, 2004 @11:10PM (#9664170)
        Here's an example:

        Bayesian filtering

        Thunderbird wouldn't be the same without it. Does it drag your system to a halt? Nope.

        I'd be awfully surprised if anything real CPU intensive would ever be installed into Firefox by default. Give these guys some credit.
        • by talaphid ( 702911 ) on Saturday July 10, 2004 @11:32PM (#9664284) Journal
          Speaking of Bayesian filtering, some form of clever-er guessing as to where my next bookmark in my ecclectic collection of bookmarks goes. Sample relatively unique keywords in pages as bookmarked, weight towards bookmark folder baskets, bingo.

          Avoid more sophisticated algorhythms that infer a sorting methodology the same as the developer, however. Maybe I have a Programming folder which has C in it, and so you'd infer that all characteristics of matches to Programming inherit to C, if that's the sort of sorter you are, and that fits with you, me, and program-think, so that's right? Right? Except perhaps I'm a university student who has a University folder, and I'm studying Java, whose extrinsic attribute prioritizes sorting it into that group... so you'd end up with a word weighting argument between superclass Programming, which is wrong, and Java, which is right.

          Let me be clear. This suggests nothing at all about helping the user organize their bookmarks - everyone has their own system (although perhaps a Bayesian category guesser would be a separate fun feature). This suggestion is merely better guessing of first suggested folder when I CTRL-D.
          • That would be kind of useful, if I wasn't currently drifting away from browser-handled bookmarks and instead using a customized start page using CSS-generated menus [meyerweb.com], in which you can fit anything from links to the Google search <form>.
            hmm, that would be a nice feature - a start page generated from your bookmark folders, utilizing meta-bookmarks (which are in fact HTML snippets). Add customizable CSS and a name like about:start and I'd be sold.

            And if you want to cram a learning algorithm into that,
        • by SpootFinallyRegister ( 787720 ) on Sunday July 11, 2004 @01:00AM (#9664642)
          Yes, I do know what machine learning is, and I have to agree with the original poster. Not because there aren't potential uses for machine learning in a web browser, but because this is a fundamentally wrong way to develop software.

          Useful solutions to problems arise by developing solutions to problems, not by developing solutions and trying to shoehorn them into existing solutions. Create what you need -- dont create need to fit your ideas.

          Carving a square peg and wandering around a city to see where it fits makes much less sense than finding teh hole you need filled, and carving the peg to match.
      • Answer: (Score:5, Insightful)

        by bersl2 ( 689221 ) on Saturday July 10, 2004 @11:30PM (#9664271) Journal
        Extension. It's why that framework exists.
      • by torokun ( 148213 ) on Sunday July 11, 2004 @01:55AM (#9664846) Homepage
        Here's my idea: Use machine learning to figure out the most-often-used code paths, and thereby allow users to optimize their browsers by removing or unloading from memory the least-needed functions, or rearranging the code to allow fastest access to those functions... Maybe you could still leave stubs in the UI so that code could be dynamically loaded if someone eventually decided to use something they rarely used. A self-optimizing browser that sheds the code you don't give a crap about. Now that would rock.
        • It's already here. It's called virtual memory. Rarely used sections of memory get paged to disk in favour of more commonly used pages.
      • ...a browser that doesn't have machine learning in it. Seriously, Firefox is slow enough for me. What on earth would you possibly need "machine learning" for in a web page browser? I'd immediately switch back to Opera (I don't use it simply because input forms lag during page-loading, some sort of multithreading issue).

        Ah one of the "I Don't want any feature because it will slow down this product person even though I have no idea on how it is implemented". Don't complain about it until you see it in acti

      • I hate to do a 'me too' post, but short of a /. poll on the subject, how else are we to voice how strongly we feel about this?

        I like my applications to be as discrete as possible. Sure, make them interoperable, but don't turn a sharp little application like Firefox into a clumsy swiss army knife.

        The obvious compromise would be to make these advanced features an extension that can be plugged in, but, although I'm not familiar with the Firefox source, I am an experienced programmer and I'd expect somet
    • by Anonymous Coward
      1. Based on the user's browsing habits, automatically bookmark the most frequently visited sites, and automatically put them into *multiple* categories (not just one category) to make them easy to find.

      No, don't do that. You think I want my favorite porn sites automatically bookmarked without me realizing it so my wife can see it and bitch me out?

      OR how about at work when some site I look at while goofing off ends up in my favorites? Ya, I really need some sports wesbites showing up in my favorities at w
    • by omaha ( 41554 ) on Saturday July 10, 2004 @10:57PM (#9664087) Homepage
      2. Create a full-text index in real-time of every page that has been browsed. When the user visits any web page, display a sidebar of "Related previously-viewed pages."

      see http://pychelsea.sourceforge.net/ [sourceforge.net]

    • 2. Create a full-text index in real-time of every page that has been browsed. When the user visits any web page, display a sidebar of "Related previously-viewed pages."


      Yes, yes, yes! I've been interested in something like this for a long time.

      To 'tune' the idea to more of what I've had in mind, the full-text index would have tuning capabilities. The user should be able to cache all data in given categories for later reading and retrieval, as well as be able to 'highlight' specific data as they go along.
    • All good ideas and I am sure that many people will come with some other good ones but please, the most important is to give the option to easily Turn Them Off! For example I would like to be able to turn on and of JavaScript from a button on the browser. The same way it would be nice to be able to customize a toolbar where you had an on off buttons for those features that I maybe don't want to use all the time.
    • by Stray7Xi ( 698337 ) on Sunday July 11, 2004 @03:54AM (#9665230)
      Machine learning? how bout user learning... there's enough shortcuts and quirks to justify "tip of the day"

      but on machine learning...
      6. Prioritize the order in which images are downloaded on screen, based on whats most useful to the content of web page.
      7. Recognize which links on a page are most likely to be visited and place it on a menu available on status bar(similar to styles)
      8 Produce a citation for the current page (try to find author and other important details)
      9. Based on font resizes the user has done in the past, make sure the fonts displayed on page are large enough for user to read.
      10. Recognize when a user does a search at a site often enough and offer to add it to their search bar.
      11. Recognize which webpages the user is more likely to revisit, to sort the history by.
  • by chendo ( 678767 ) on Saturday July 10, 2004 @10:37PM (#9663964)
    .... considering how much Google gave out to drop the prices on eBay.

    I suggest better prizes. Y'know, like a girlfriend? I'm sure lots of us Slashdotters would like to have one over a Gmail account :p
  • ideas (Score:5, Interesting)

    by pangel83 ( 598985 ) * on Saturday July 10, 2004 @10:37PM (#9663971) Homepage
    - The pop-up management in modern browsers who provide this feature although more efficient than in the past is still not perfect. Adapt to what pop-ups a person normally uses

    - Content highlighting (especially in news sites). Learn what types of news articles / subjects a user is interested in, and highlight titles in news pages that suit the user.

    - Accelerator for narrowband connections. Predict which pages the user is more likely to visit next, and start loading them as the user still reads the previous page.

    - Recognise efficiently scam sites? Protect users from fraudsters?

    PS: Not machine learning, but the sole requirement by me for a browser (dunno if its done in firefox now as hvent used it for a long time): Open new tab as a default rather than a new window, or at least provide the option.

    • Re:ideas (Score:3, Informative)

      by __aawavt7683 ( 72055 )
      Requirement met: www.google.com, type in tabbrowser extension. First link is a mozilla plugin that has that as an option.

      Actually, it was the third link. http://white.sakura.ne.jp/~piro/xul/_tabextensions .html.en#download

      Click the third link down on that page at that area (Download Tabbrowser Extensions [tabextensions_en.xpi]" and there ya go.

      -DrkShadow
    • Re:ideas (Score:2, Informative)

      by Dizzle ( 781717 )
      "PS: Not machine learning, but the sole requirement by me for a browser (dunno if its done in firefox now as hvent used it for a long time): Open new tab as a default rather than a new window, or at least provide the option."

      3 words: Tab Browser Extensions. It's all there.
    • Re:ideas (Score:5, Insightful)

      by TheFlyingGoat ( 161967 ) on Saturday July 10, 2004 @11:08PM (#9664158) Homepage Journal
      Accelerator for narrowband connections. Predict which pages the user is more likely to visit next, and start loading them as the user still reads the previous page.

      This is the only suggestion so far that really seems worth making the browser larger (and hence, slower).
      • Re:ideas (Score:5, Informative)

        by hab136 ( 30884 ) on Saturday July 10, 2004 @11:45PM (#9664338) Journal
        Accelerator for narrowband connections. Predict which pages the user is more likely to visit next, and start loading them as the user still reads the previous page.

        This is the only suggestion so far that really seems worth making the browser larger (and hence, slower).

        Link Prefetching [mozilla.org] is already in Mozilla/Firefox.

      • by DonGar ( 204570 ) on Sunday July 11, 2004 @02:07AM (#9664886) Homepage
        I frequently adjust different aspects of my browser for different sites. Adjusting the window size/position, bump up font size by 10%, allow/block images, whatever.

        I'd like a system that remembers those adjustments, and not only reuses them when I return to the same site, but applies them again where appropriate. 'Where appropriate' is where machine learning comes in.

    • Re:ideas (Score:3, Informative)

      by Biogenesis ( 670772 )
      Here's a partial solution to the "open in new tab" problem:

      #!/bin/sh
      export MOZILLA_FIVE_HOME="/usr/lib/mozilla-firefox"

      ur l="$1"
      if [ "x$url" = "x" ]; then
      url="about:blank"
      fi

      if $MOZILLA_FIVE_HOME/mozilla-xremote-client -a firefox openURL\("$url",new-tab\); then
      exit 0
      fi
      exec $MOZILLA_FIVE_HOME/firefox "$url"
      </TT>

      Shamelessly ripped from Here [osnn.net]

      • Re:ideas (Score:3, Interesting)

        by mbourgon ( 186257 )
        How about an option to open javascripted newwindows in a new tab? Lots of sights have slideshows that want to spawn a window - if you middle-click on them, the tab is empty.

        Hell, an easy way to save movies just shown - the page info doesn't seem to save "media" or "embed" objects.
    • Don't ask me if I want to remember a username/password combo until AFTER the login has been successful.
  • auto-focus (Score:2, Interesting)

    by Anonymous Coward
    find a way to automatically aim focus at the box that the user seems to use most on a given page. this is my major annoyance with some form sites that insist on giving focus to something that i would never fill in first.
    • Or at least the frame the user uses most often. It's also annoying to hit page-down, and not get the result you want (eg. you're focused on a frame that can't scroll, for instance)
  • My Requests (Score:2, Insightful)

    by green pizza ( 159161 )
    1) Make it faster
    2) Please keep GTK+ 1.x support
    • 1) Make it faster

      Hope that includes making it smaller as well since it takes an awful long time just to load on an older laptop.

  • Smart Tabs (Score:5, Funny)

    by c0dedude ( 587568 ) on Saturday July 10, 2004 @10:41PM (#9663992)
    Make it so you can open all links on a page in new tabs, and the browser will sort them by content.

    Also, it would be awesome if using the internet were more like playing Fallout. That was a great game.
  • idea (Score:5, Interesting)

    by undertow3886 ( 605537 ) <geoff@@@amsa...info> on Saturday July 10, 2004 @10:42PM (#9663993)
    Make it so when the user hits the Page Down key, a horizontal line appears for a few seconds where the old bottom of the page was, then fades away. So when you're reading long sections of text and hit Page Down, your eye can quickly scan to where you left off.
    • Re:idea (Score:2, Insightful)

      by laserbeak ( 794029 )
      same with middle click scroll, have a transparent gray line where the top of the page was when middle click scroll was clicked!
    • GOOD IDEA!!! (Score:5, Insightful)

      by green pizza ( 159161 ) on Saturday July 10, 2004 @10:55PM (#9664077) Homepage
      Whoa! Good call! That's an awesome, basic feature that could easily be added to FireFox without bogging it down.

      (So many of the other suggestions so far would make FireFox slow to a crawl. Lets keep it lean and mean, please!)
    • Re:idea (Score:2, Insightful)

      by da_foz ( 751028 )
      Nice idea. But you would only want it to happen when there is a large section of text that goes off the bottom the screen. Part of the learning for this feature would have to be to recognize when this is the case. ie you would not want to to happen is you are scrolling down to look at a picture or if you are already at the bottom of the text.
    • Damn that's a great idea! I have developed the habit of using my mouse to highlight random text on the page I'm reading before hitting [PAGE DOWN] for this very reason. This would eliminate my need to reach for the mouse everytime I want to do this.
    • Re:idea (Score:2, Insightful)

      by mforbes ( 575538 )
      I -love- this idea! I've suffered from the same problem for years, especially in long articles like those to which Slashdot frequently links, and never bothered thinking about how to fix it-- but this idea is a WINNER!

      Yeah yeah, I know, I'm not really adding anything to the value of this thread-- but i wanted undertow3886 to know how much I like the way he (she?) thinks.

    • Re:idea (Score:3, Interesting)

      by moonbender ( 547943 )
      Weird. I don't find this useful at all; I never had any problem with finding the line I stopped - it's always at the same basic spot, after all. Maybe it's something I picked up reading a couple thousand pages worth of e-books on computers... :)

      In any event, sounds like im in a harsh minority here, everybody else seems to be thrilled by the idea. And I guess I could always turn it off, beside the fact that it doesn't sound very intrusive at all. Nice.
  • lets see here (Score:3, Insightful)

    by deus_X_machina ( 413485 ) on Saturday July 10, 2004 @10:45PM (#9664004)
    "we're also seeking ideas that will make Firefox 2.0 blow every other browser out of the water."

    The competition: Internet Explorer, Netscape, Lynx, and Safari.

    I'd say it's already pretty much covered...
    (love my FireFox)
    • Re:lets see here (Score:5, Insightful)

      by green pizza ( 159161 ) on Saturday July 10, 2004 @10:51PM (#9664059) Homepage
      >> we're also seeking ideas that will make Firefox
      >> 2.0 blow every other browser out of the water.

      >The competition: Internet Explorer, Netscape, Lynx,
      >and Safari.
      >I'd say it's already pretty much covered...

      I would have to agree that FireFox is pretty much the best browser today. But the performance on older hardware isn't much better than that of a new build of Mozilla. Also, the move to GTK2 and feature-creep are slowing it down futher.

      Safari on the other hand is very basic and very fast. Try it on a 300 MHz G3 to see what I mean. Apple did a great job of making a basic browser out of Konq/KHTML. I would **LOVE** to see a similar app for X11 (so I can run it on Linux, Solaris, and IRIX). Konq Embedded is close, but it's lacking some important features and isn't as fast as it could be.

      Long live FireFox--but keep it slim!
  • How about... cookie management? I want my amazon/slashdot/gmail cookies, but um, why are there porno and advertising cookies on my computer? (wait, no. Don't answer that.)

    The plan.... a bayesian filter on cookies! Or predict my browsing habits, and load the page before I click on it... or... FIND ME PORN THAT I LIKE!

    (hey, this post is a stream of consciousness. I'm brainstorming. You're gunna get porn. deal with it.)

  • these may be implemented already somewhere but here goes:

    1) say the user goes to the onion or gamespot a lot (that's me) and if I hit the refresh button right away the advertisement page goes away. if firefox noticed I did that all the time, it might ask me if I want to refresh right away always on that domain to save me the hassle.

    1.5) apply the above idea to watch for user behavior on various domains / pages. automate repetitive tasks

    2) catalogue what data users enter in what form controls. When a

  • FYI (Score:5, Informative)

    by grammar fascist ( 239789 ) on Saturday July 10, 2004 @10:46PM (#9664022) Homepage
    (Undisclaimer: I do machine learning research at BYU.)

    Machine learning, in general, is getting computers to generalize based on data instances. The two main flavors are classification (inferring classifications of data instances based on previous instances) and regression (inferring a function based on input/output pairs).

    A lot of people incorporate artificial intelligence into the category "machine learning," though it's not strictly correct. Machine learning is more a branch of AI than anything. One way to keep them straight is to think AI = deduction, ML = induction. (That's vastly simplifying, but it helps to classify them roughly.)

    I wonder which way the author leans? Could he possibly post to clarify his meaning? :)

    You can do an awful lot with machine learning that you can't do with conventional techniques. You can often get great results for otherwise NP-hard problems. Slashdot had a story a while back about using machine learning to do mesh compression [slashdot.org], in which their algorithm comes up with a close approximation to the real answer to an NP-hard problem in polynomial time.

    I'm currently using it to interpolate 2D images, and kicking bicubic B-spline interpolation all to heck. (Paper pending...) The machine learning algorithm infers shapes from the pixels, and keeps edges sharp.

    If I come up with an idea, I'll post it later. In the meantime: isn't Firefox supposed to be lean and mean? :)
  • by MSBob ( 307239 ) on Saturday July 10, 2004 @10:48PM (#9664040)
    I've been waiting for searchable bookmarks for about a decade now and it is yet to appear in any web browser. Bookmarks as implemented in today's browsers are useless. They are unmanageable beyond twenty or so and the interfaces to keep them "organized" in "folders" are clumsy at best.

    There. Your most important feature that browsers never had. Searchable bookmarks. Doesn't get much simpler than that. Am I the only one who thinks it's something every browser should have had long time aog?

    • Searchable bookmarks?

      Doesn't that just mean use google? ::)
    • by jeffehobbs ( 419930 ) on Saturday July 10, 2004 @11:27PM (#9664250) Homepage

      I've been waiting for searchable bookmarks for about a decade now and it is yet to appear in any web browser.

      Your decade is at a close! As of version 5, available today, Omniweb [omnigroup.com] has both searchable bookmarks and history, Launchbar [obdev.at] (also available now) can search across all browser bookmarks simultaneouslt, and Safari 2.0 [apple.com] will have this kind of functionality as well next year in Mac OS X "Tiger".

      ~jeff
    • by Coryoth ( 254751 ) on Saturday July 10, 2004 @11:59PM (#9664395) Homepage Journal
      The crux of the problme here is that (looking at your replies to other posters) you want to search the content of pages in your bookmarks (or even just in your history).

      For someone that's keen (I may even try myself if I find some time) couldn't a close approximation of this be done VERY simply by just sending a search request to google restricting to sites listed in your history? All you need to do is parse out the unique sites from your history or bookmarks, and just pass those in to the google search. You could practically just write a bookmarklet to do it right now...

      Jedidiah.
    • It isn't freeware, and if someone wants to clone it as a freeware, Firefox-aware app that'd be great, but it does just what you are asking for.

      It narrows the bookmarks as you type, based on title, URL, and keyword fields.

      http://www.kaylon.com/power.html

      If Firefox had this built in via a search bar or some such it'd be awesome.
  • by eyefish ( 324893 ) on Saturday July 10, 2004 @10:49PM (#9664047)
    How about this: How about browsing the filesystem using tabs?

    So for example, in one Firefox window you see the contents of your hard drive (or network folders) pretty much the same way as the Windows Explorer or Gnome/KDE/MacOSX show it to you today BUT if you click the middle mouse button on a directory (or select "open in new tab") you end up with the new directory being open in a new tab.

    Think about it, how many windows do you usually have open browsing your filesystem? with this thing you have ALL those windows in the same window organized by tabs, PLUS you also have all you websites as well on tabs right along the filesystem tabs!

    And here's another kicker: You can bookmark a group of filesystem browser tabs and later go back to them. You can even drag the group of bookmark tabs to the desktop so that when you double-click on it Firefox opens up all of them at once.

    This should all be done with host filesystem integration so that you can drag-and-drop files between the firebox filesystem view and the normal host OS desktop.

    • by Sparr0 ( 451780 ) <sparr0@gmail.com> on Saturday July 10, 2004 @11:34PM (#9664293) Homepage Journal
      uhm, every mozilla-based browser, as well as konqueror, can already do almost exactly what you have described. i just checked, i can do all of it except drag the bookmark to the desktop and filesystem integration in mozilla and firefox, and it only takes about 30 seconds to make a shortcut to the bookmark group on the desktop manually. in konqueror i can do all of it except tabs (it opens a new window on middle click, which is as good as tabs if you give it its own desktop) and bookmark groups, although i understand theres a patch underway to enable bookmark groups. if the features of moz and konq were combined you would have EXACTLY what you described. so, nothing new here, just a wish to put them together.
      • Konqueror can be made to open a new tab instead of a new window.

        From in Konqueror, Settings -> Configure Konqueror -> Web Behavior -> Open Links in a new tab instead of a new window.

        It says links, but the setting applies to filesystem browsing too.
  • by Manip ( 656104 ) on Saturday July 10, 2004 @10:53PM (#9664069)
    Does anyone else get the feeling that they are adding this just for the sake of it or so they can say they have it? I mean when you have the technology before any useful uses for it then clearly there is something wrong.

    I think that creating a good browser though gimmicks is a poor long term strategy and seriously doubt this route will turn up anything useful. Ideas should be so simple and obvious and inspire us to say 'who dont we have that already?!' not something we search for!
  • Intelligent mail sorting. This isn't my idea, but unfortunately I can't remember where I heard it (probably someone else's slashdot post). The same methods that are used today in bayesian spam filtering could be used to sort my mail into folders for me. This might prove a little more difficult than spam filtering because you have n categories instead of just 2 (spam and not spam), but it would certainly be useful.
    • Intelligent mail sorting. This isn't my idea, but unfortunately I can't remember where I heard it (probably someone else's slashdot post). The same methods that are used today in bayesian spam filtering could be used to sort my mail into folders for me. This might prove a little more difficult than spam filtering because you have n categories instead of just 2 (spam and not spam), but it would certainly be useful.

      Yes, folder categorization is MUCH harder than spam classification, and the implication is y

  • by jaaron ( 551839 ) on Saturday July 10, 2004 @10:58PM (#9664095) Homepage
    Isn't this going about things backwards a little?

    To me this sounds like a clear case of "technology X is really cool. Let's find some reason to include it in product Y." Which often means that product Y becomes much more complicated than it needs to be.

    How about first looking for a list of browser "needs" so to speak. What would make the best browser? What current deficiencies to browsers have? And so on. Then, if you really want to, try to figure out if any of these problems could be solved with machine learning.

    Don't just inject a technology into a product because it's cool. Make sure there's a real need for it.
  • Bookmark Clustering (Score:4, Interesting)

    by women ( 768472 ) on Saturday July 10, 2004 @10:58PM (#9664098)
    I don't know if this has been posted, but I'll give it a shot...

    I've accumulated well over a thousand bookmarks and have been much too lazy to organize them into folders. If you could automatically cluster bookmarks (http://vivisimo.com/ does this with web results) I would be eternally grateful.

    One more suggestion is to learn usage patterns in a particular website. For example, when I go to http://www.nytimes.com, I generally click on the opinions sections. If the browser could anticipate that I typically go to the opinion section, it could start to preload it before I click on it.

    I realize the later suggestion is much easier to implement than the former, but the clustering would be very useful for lazy surfers like me.
  • by Smidge204 ( 605297 ) on Saturday July 10, 2004 @10:59PM (#9664099) Journal
    Make it an extension only!

    Seriously, it would be a really neat feature if some of the suggestions posted here were realized... but this whole idea screams of bloat bloat bloat. What makes FireFox so appealing for some (including me) is it's compactness and lack of bells and whistles. The FireFox project FAQ echos these sentiments: It's small, fast, simplified, nothing other than what you need. "Just a browser" [texturizer.net]

    Don't let feature creep ruin it!
    =Smidge=
  • MOST IMPORTANT... (Score:5, Insightful)

    by Eric_Cartman_South_P ( 594330 ) on Saturday July 10, 2004 @11:02PM (#9664111)
    ...include an option to TURN IT OFF.

    I hate it when anything software tries to "predict". I don't want it. Please make sure it has an OFF button. Seriously. Thank you.

  • Suggest synonyms, correct spelling, provide "improved" search queries based on previous related searches which will yield better results. What i mean is that if i'm searching for something on google and my first two searches turn up nothing wouldn't it be nice if it could help me search better

    Already got gmail account!
  • I have a hammer. Does anyone have suggestions on how I can integrate nails into Firefox 2.0?
  • by Dark Lord Seth ( 584963 ) on Saturday July 10, 2004 @11:04PM (#9664132) Journal

    Do NOT bloat the browser.

    Want to add crap? PLUGINS!

  • by Mmm coffee ( 679570 ) on Saturday July 10, 2004 @11:04PM (#9664136) Journal
    ...Is to make it very easy to turn whatever machine learning features incorperated into 2.0 off totally, with minimum fuss and searching.

    It is my firm belief that then #1 rule of UI design is that the program should should look and act consistant. And the number two rule is that the program should never assume anything, or perform any action without the user explicitly telling it to (barring sane default behaviors that will fit > 85% of the users). Every ML feature I have ever seen breaks #1 and #2 with reckless abandon by changing something to make it more 'friendly', which in turns makes it less friendly because I don't know _exactly_ what to expect from my program.

    Looking at the comments on that weblog, I can not find a single idea that does not either violate my top two rules, or would otherwise annoy me to no end. If they have to add that to Firefox then please, let me turn that crap off in three mouse clicks or less.
  • by dduardo ( 592868 ) on Saturday July 10, 2004 @11:06PM (#9664147)
    The firefox download manager should scan downloads for malicious spyware, stop the bad download(s) and warn the user of the danger posed by the file(s).
  • by zwalters ( 532390 ) on Saturday July 10, 2004 @11:08PM (#9664156)
    A moderately annoying, but extremely common procedure when I'm browsing is to have a specific destination in mind, say Baseball Primer http://www.baseballthinkfactory.org/files/primer/

    Now, because this has a lot of discussions, when I start typing basebal... I get a lot of urls in the autocompletion field like http://www.baseballthinkfactory.org/files/primer/o racle/

    or even unrelated baseball sites. So it's not uncommon for me to have to press downarrow several times. A very useful application of machine learning would be to order the autocompletion possibilities so that my average number of downarrow presses is minimized.

  • by Animats ( 122034 ) on Saturday July 10, 2004 @11:16PM (#9664203) Homepage
    What's wrong with this picture?

    Adding what passes for "machine learning" to a user interface usually results in something that does the right thing some of the time, the wrong thing some of the time, and you can't figure out why.

    Bayesian spam filtering is becoming like that. At first it worked, but it's breaking down under the rising percentage spam.

  • by killjoe ( 766577 ) on Saturday July 10, 2004 @11:17PM (#9664205)
    Have it learn where I am saving which files and offer up that directory as default. If I am saving all pictures into one directory and all movies into another it should know that.

    I want virtual folders in my mail. These are "live" queries like "all mail today" or "all mail marked urgent". As I mark metadate on the email they will show up in the proper virtual folder.

    Full text search of all email.

    Choice of multiple home pages. It learns when I want my home pageX and homepage Y.

    Roaming bookmarks!!!. While I am at it roaming everything including profiles and preferences. The ability to carry my email filters from location to location would be awsome.

    A network install where the administrator can set global prefs and install global plugins. I also want the option to override the users preferences and lock them out of certain setting.

    It should learn to adjust my font size (and other settings?) based on site. If a web site always puts tiny print then I want the fonts larger only for that site. Perhaps have it learn "ugly" sites and put my default styles instead.

    Auto proxy. I want to feed a list of proxy servers and have it switch randomly (even from one site to another). Think of this as super privacy.

    Ability to arbitrarily morph the the incoming text stream using regexp or javascript. This would allow me to roll my own weird crap.

    Make XUL 50 times better. Make it so it's trivial to use XUL to make database front ends. Give me a great GUI builder for it.

    I have lots more ideas but that's enough for now.
  • by slobber ( 685169 ) on Saturday July 10, 2004 @11:24PM (#9664236)
    Currently, if I start typing URL in the address bar, it matches URLs alphabetically. This gets very annoying at times, especially if you accidentally type giigle.com instead of google.com and then it keeps on matching giggle.com for weeks when I type "g".

    This problem can be fixed by using frequency count with some time decay. For example, if I went to google.com 100 times within last week and once to giggle.com, then match to google.com on "g". If, however, I went to giigle.com 5 times recently, then match to giigle.com

    While one might argue that this makes the algorithm unpredictable from user's standpoint, in my experience people keep on typing until they see the correct match. So, this way they'll see the right match sooner on average.
  • by AllNicksWereTaken ( 741964 ) on Saturday July 10, 2004 @11:29PM (#9664263)
    (Clippo, from Office, featured in Firefox...) Clippo: It looks like you're browsing pornography. You also appear to be typing with your left hand. Would you like to enable the spellchecker?
  • by MonkeyBoyo ( 630427 ) on Saturday July 10, 2004 @11:32PM (#9664285)
    Often masses of information are broken into multi page presentations.

    Somewhere on the page you have buttons named things like Next, Previous, or Page: 1 2 3 4 5 6.

    There may be good design rules for positioning these elements but often they are not followed.

    I've found many instances where I have to scroll up or down just to find the Next button so that I can click it.

    It should be possible to learn for a given site (or sub-tree of a site) what the Next and Previous buttons are just from user behavior and the nearly identical layout of say page 2 to page 3. I think this could be done without parsing any of the html or gifs associated with the buttons.

    If Firefox could learn and extract multi-page navigation then these functions could be bound to buttons up on the menu bar, or assigned to keys, and the whole problem of scrolling to find a Next would go away.
  • by shaitand ( 626655 ) * on Saturday July 10, 2004 @11:35PM (#9664295) Journal
    There are three types of sites in the world:

    Those that use flash for ads
    Those that use flash for content
    Those that stay the hell away from flash

    Rightnow, Firefox doesn't have any way to tell the difference between 1 and 2. But I do, I can clearly see if it's an ad or not. On every flash ad give me the option to tell the browser it's good flash or bad flash and intelligently learn what sites ("sites" also being defined by study of the urls, if I say www.bob.com/~jimbo/whatever.htm and www.john.com/~jimbo/howie.htm and www.curly.com/~jimbo/marthastewart.html are bad it should figure out there is a commonality in the ~jimbo part and apply my preference) have bad flash and block flash content on those sites, instead presenting me with a button to load to allow that content to load.

    It should use a number of pieces of information, the url of the page, the url of the flash animation, the size of the animation, the name of the animation, the server the page is being served off of, etc.
  • by doctor_no ( 214917 ) on Saturday July 10, 2004 @11:41PM (#9664317)
    Smater Front Page: Making use of the first thing that the user sees when starting up.

    How about creating an interface for the default page for Firebird. Instead of pointing to the Mozilla.org homepage creating a default Front page designed to evolve to the habits of the user. Whatever way you want to utilize machine learning, you will need a centralized location to acess the results, why not use "Home". That being, creating a simple interface (XUL, not html) that points the user to their most visted bookmarks, or a catagorised and searchable list of their bookmarks(or internet), or updating the user if their most visted sites are updated, aggregating information from sites from their own browsing habits in a single interface when the browser starts up.

    Also, if the user uses Thunderbird or Sunbird updating the user of new E-mails and new appointments on the front page. A front page that is customizable to the needs of the user, and avoiding the clutter and ads of commerical sites, and that is local on the users computer and not centralized on a website. And most importantly makes the individual users own data most intuitively accesible to themselves, and evolves to fit the individual user.
  • by mr_gerbik ( 122036 ) on Sunday July 11, 2004 @01:02AM (#9664654)
    According to ebay, that prize is worth about 95 cents!
  • by Dwonis ( 52652 ) * on Sunday July 11, 2004 @01:38AM (#9664787)
    (But give it a better name).

    Firefox needs an option to make the browser detect, and work around, user-interface abnormalities in poorly-designed websites.

    It's fairly well-established that the best user interfaces are the ones where there is no discrepancy between what the user thinks is going to happen, and what actually happens.

    When a user single-clicks a link, the link should open in the current window. Always. Any other behaviour (such as opening a new window) causes the user to be frustrated (or at least slowed down).

    Similarly, when the user middle-clicks a link (or shift-clicks or whatever), the link should always open in a new window/tab. No oddities like "javascript:gotosite()" or "http://path/to/exact-same-page.html#" should happen.

    Unfortunately, there are a lot of misguided website authors that think they're being helpful by doing non-standard things in an attempt to anticipate users' needs. This means that you'd need some type of machine-learning in order to work around these problems at the browser level.

    I imagine this would be done in a way similar to how SpamAssassin works.

  • Truthalizer (Score:5, Interesting)

    by erixtark ( 413840 ) on Sunday July 11, 2004 @03:10AM (#9665123)
    I would want a credibility rating on web pages.

    There is a lot of information on the web but almost no way to verify the data. I would like a way for people to report the credibility of the information contained on a web page.

    This is especially important with news reporting and double-extra-especially important in times of war.

    It is also all too simple for politicians, journalists and other people of power to repeat the same old lies over and over.

    The memory of media is short. A Truthalizer would help make it a bit longer.
  • Suggestions (Score:4, Interesting)

    by Issue9mm ( 97360 ) on Sunday July 11, 2004 @04:30AM (#9665290)
    Yes, I know that there are plugins for this one, but keeping bookmarks in sync on multiple machines via server storage. Since I got my laptop, it's not a big issue, but I hate having to go back to my other machine cause that's where the bookmark is.

    Enabling persistent storage of passwords. I honestly don't know how much or little of a security hole this would be, but I am constantly using the "remember my password" feature in Moz/FF, and it KILLS me when I have to reinstall and start adding them all again. Have it store that file to a spot on the hard drive (or better, server sync if that option is turned on), and allow me to keep that data even if I have to uninstall/reinstall the application.

    -9mm-
  • Saving images (Score:3, Interesting)

    by torklugnutz ( 212328 ) on Sunday July 11, 2004 @04:56AM (#9665359) Homepage
    I grab images off websites all the time and save them into a bunch of different folders. I would like if the browser paid attention to where I saved certain images from certain sites and then automatically jump to the folder I last used FOR THAT SITE. It would save me a few seconds a day, at least.
  • by nyri ( 132206 ) on Sunday July 11, 2004 @05:08AM (#9665385)
    People regard their privacy highly and are reluctant to provide their personal information to websites. Website which need personal details from its users have to convince a new user that it won't misuse the data that it has. This is usually made with a statement, known as Privacy Policy.

    Privacy Policies have problems. First, they are usually written in a legal language incomprehensible to a lay person. Second, people have no way of knowing that website in fact follows the policy it has.

    One way of assuring people might be machine readable privacy policies. P3P [w3.org] (Platform for Privacy Preferences) is W3C framework for such privacy protocol. It allows user to store his personal data to a P3P agent, which will then follow the user given rules to share private information.

    This agent should be implemented to firefox and it could use a machine learning to automate further the interactions with websites. The agent could for example learn, that if website's privacy policy promises to use user's e-mail address only for initial consistency check and to send a forgetten password if user explicitly asks for it, the agent can give it to the website without prompting the user.

    Of course, this won't solve the problem of malicious websites which don't follow their privacy policies, but is a step into right direction where privacy policies are certified and their enforcing is auditted.
  • Give me VIM (Score:3, Interesting)

    by Kent Recal ( 714863 ) on Sunday July 11, 2004 @09:36AM (#9665884)
    Machine learning?
    Give me a break.

    What I'd like to see if I could finally use vim for these damn textareas (or any editor of choice for that matter).
  • by tempshill ( 413165 ) on Sunday July 11, 2004 @10:25AM (#9666057)
    The following is the best e-mail idea of the last two years, and should be a feature of every e-mail program.

    As a person who writes a lot of e-mail, or as a manger, one major organizational problem is simply not having your messages replied to. It currently takes a ton of manual effort to decide which of your e-mails need a followup by you. So much effort that nobody does it. Some questions fall through the cracks when you write a hundred e-mails a day. Closing up these cracks would measurably, demonstrably improve the effectiveness of managers, project coordinators, and any other heavy communicators, since e-mail has become the most important and most used mode of communication for a large number of people. (It is very rare to see a manager actually send out an e-mail asking "Was this resolved?" Generally, un-followed-up e-mail is simply forgotten.)

    A system for followup could be partially automated.

    1. A checkbox exists when writing an e-mail. If turned on, it tells the e-mail client, locally, that you want to make sure this e-mail has been followed up. I'll call this a "follow-this-up", or FTU, e-mail.

    2. After an FTU e-mail has been sent, the e-mail client remembers it in a list. Once an FTU e-mail is sent, a copy of it is placed in a mail folder which I'll call the FTU folder. The user of the e-mail client can open this mail folder at any time to see the FTU e-mails that still have to be followed up.

    3. When the client detects that a recipient of an FTU e-mail has replied to that e-mail, then it provisionally removes the FTU e-mail from the FTU folder. (Probably the e-mail is grayed out in the list but not actually removed.) Making this detection complete and thorough is an interesting problem. The starting point would probably be based on receiving an e-mail from the recipient with an appropriate subject line (e.g. the same subject line prefaced by FW: or Re: or Re[5]:). And in order to increase the effectiveness of this technique, the client might actually maintain a database of previously-used subject lines that are already in the FTU folder, and nag the user if he sends a second e-mail with the same subject, asking him to write a more elaborate Subject line.

    Other starting techniques could include parsing the e-mail's content to see if part of the content matches an FTU e-mail that has been received. Or by utilizing e-mail fields or even implementing a new e-mail field which hopefully doesn't get stripped when the recipient replies.

    This system, then, tries to ensure that after an FTU e-mail is sent, there is either a copy of the e-mail in the FTU folder so the user can see that the recipient hasn't followed up, and the user can follow up with a question; or there is a response from the recipient in the user's Inbox.

    User interface is critical to making this system useful for the user:

    4. When the client sees that an FTU e-mail has been replied to, it presumes that a followup has actually occurred. This obviously may not be true; the recipient may have responded with a joke, or with a followup question, or with "I'll get back to you Thursday". Presumably when you read a followup to an FTU e-mail, a new bar of UI should appear in the client saying to the user "This looks like a followup to an FTU e-mail you sent, which you can view by clicking here [slashdot.org]." Buttons would let the user choose things like "Yes, this resolves my FTU e-mail completely" or "No, I still need a followup", or "I now want my reply to this e-mail to be an FTU e-mail, and not the parent."

    5. I imagine that the FTU folder displays its FTU e-mails in date order, showing the oldest non-followed-up e-mail at the top (colored red after 2 to 4 days or so). The user can mark these e-mail copies as already-followed-up (i.e. it's resolved, no more followup needed, because the recipient saw me in person and resolved it).

It is easier to write an incorrect program than understand a correct one.

Working...