| Good work. Here are couple suggestions: - The long list of articles at the bottom of the page seems excessive; why not just show 10 most recent? expand when needed; it just clogs the UI - Good HTML should read like any other good source code, if the logical, visual, and functional pieces of HTML are broken into disorganized pieces it will be hard for a new member of your team to keep up or to maintain it - CSS doesn't have any comments [EDIT: this is just for showing your code - if you want good feedback let me understand what you are trying to do - geez] - You should do some simple clustering or at least use some epsilon in you document similarity matrix to distinguish if an article is new or not; if same or very similar article shows on three different web-sites will it have triple effect on the prices? For example: http://www.insidefutures.com/articles/article.php?id=345842 http://www.mrswing.com/articles/Grain_Market_Analysis_from_J... - Instead of showing 10 most recent articles it's better to show N-most recent articles with S-most recent trend; this might be difficult to implement but seems more helpful for making a decision; otherwise the user has to do extra work to figure out: "this is an old news, I know that two days ago they thought otherwise, but now there is new trend" - This is not my domain of expertise and I'm not sure how traders work but this might be useful in healthcare so please keep working on it keeping in mind that this could work much better in different domains [EDIT:] Try shared neighbor distance method. Whatever you use now for similarity feed it into Jarvis & Patrick method and try different NN. This will take care of outliers and variable densities throughout heterogeneous clusters. |
Am I missing something? How is this relevant to the functionality of the site?