Friday, January 27, 2012

Real Time Search Of Live Content On Limited Memory Lossy Dynamic Data WebSites

Degradata Search = Degrading Data Search, or
Real Time Degrading Data From WebSites Exhibiting Rapid Relevance Decay Such As Image Boards, Chatrooms, Blogs, and Status Updates.

Use cases:
Users seek to discover ongoing live conversations in which their college, city, area code, or names are being discussed.

Firms seek to discover live conversations about their brand to engage their users.

Reporters seek to take the public pulse on a particular subject as research for a news story.

The primary rapid decay datasource I'd like to mine is 4chan, but other examples include tinychat, craigslist, facebook, twitter, quora, reddit, and hacker news.  It is important to note that 4chan, tinychat, and craigslist exhibit different forms of data decay than the remaining examples, that is; their data dissapears eventually.  Not so with twitter, facebook, and reddit; which experience only relevance decay without the added complication of data dissapearance.


4 chan acts like a limited memory queue of bumpable image and username optional message board threads that 404 as they fall off the queue.  A search result for a thread on 4chan is only useful within the window the thread has active contributors.  One of the boards on 4chan particularly ripe to have it's data mined is its /soc/ board that often sees threads related to people linking up in their area by sharing contact info, area codes, and self descriptions.  This and other oppurtunities for interaction have elevated the refreshing of the soc board to a modern cultural pastime for under 30's.  It's users would benefit from real time alerts when specific terms or usernames appear, the ability to see all current occurances of a search term across the site, and a daily digest of discussions involving those terms to keep them current.  The effect would extend the social aspects of the site into a slightly more formalized follow network, of sorts, with the alerts as follows, a find related alerts function for discovery, and the ability to selectively make profile and a subset of alerts public.

The topic of a particular thread is not always simple to identify.  Threads can be derailed on purpose, ask spiderman, though more commonly they simply devolve to how contributors choose to interact within them.  A naive bayes classifier applied at the thread level to return a hypothetical topic ought to satisfice this requirement, so hereon I proceed with the validity of that hypothesis as an assumption.

The pain point it solves: tbc

Tuesday, January 24, 2012

Hyperspeed Tcp

In respone to Google's post on speeding up tcp:
On the client:
Start a genetic algorithm evolving to predict the responses it will recieve when posting an initial request for tcp connection to a url.
The genetic algorithm is initiated as the request is sent out, seeding its evolution with locally stored reponses to prior tcp initiating requests to that url.
As packets are recieved, the genetic algorithm learns from it's mistakes. When the final packet is recieved, the genetic alhorithm stores this optimal solution and a snap shot of its processing at the moment before it was recieved.
When the user sends a subsequent initiating request, this snap shot is stepped into, allowing the genetic algorithm for that user-to-url connection to continue evolving in discrete time slices as users surf their most often visited websites. 
The result is increased transmission bandwith and the ability to entangle inverse genetic algorithms on both the sending and recieving hardware that can set up direct connections top> one another in advance of actual requests being sent, based on prior surfing habits. 
Each integrates the other's champions and differentiates their predictions from the other's expectations to influence it's own evolution.
This can be thought of as two people building a puzzle together on opposite sides of a glass pane, lining up where they expect pieces to fit, anticipating theg moment when the pane is removed, and all the pieces click together: the moment the request is sent.
The server examines theg consensus prophecy of the twin oracles, agggnd if it correctly predicts the request it recieved, and the response it is preparing to send in three packets, it responds with a single packet telling the client to proceed with the  assumptions of it's oracle.
Subsequent requests begin to take on a branching reality-like possibility space. Consider a sequence of oracle pairs numbered 1 through n owning puzzle images that, when composed into a movie, correctly capture a user's interaction with the server, determined expirimentally.
Subsequent sessions can reveal the most well trodden paths a user is known to travel, and eventually servers can respond to requests with a packet containing a movie id and a scene number for the user to load,skip to, and start playing from local memory.
The idea is to employ unused cpu cycles on both sides of the bottle neck to increase the concentration of the data transmitted through it.
As this concentration approaches an optimal order of magnitude, the employment of knowledge of user intent in the streamlining and catalysis of user experience is maximized; differing to free will when their prophecies are proven wrong.
submit to reddit

Sunday, January 22, 2012

Answering YCombinator's Call To Arms

About Me:
Transwoman techno-entrepreneur, software consultant, copyleft content creator, emergent technologies researcher, seeker of social justice, and entertainment entity.    


I will describe the architecture for an invention facilitating a direct consumer to content creator economy focused on the creation, sale, and management of shares of stock in creative professionals who fund their projects with funds derived from selling of said shares, with the condition that creators realease their content under a copyleft liscence, making it free and open source.

The market for a creators stock becomes a pay what you want subscription/crowd sourced patronage model, with stocks offering microdividends of future revenues raised so as to give early large investors economic incentive to identify talent and launch careers while gradually regressing to a platform for fans to interact with and fund new works by the object of their fandom.

Brief Start Up Outline:

This article is my prior art to protect from future patents, establish  an open source project aimed at developing, and propose a non-profit start up to manage: a virtual stock market system for trading in shares of content creators as firms.  Rather than purchase a cd or dvd, people purchase shares in creators they like, who use the capital to produce open source conten t.

Shares entitle their holders to portions of revenue from future shares sold, such that for every share bought the creator is also given a share, such that they are garunteed a majority stake in themselves, the business entity.  As creator fame increases, and they are picked up for  closed source projects, the invention  transforms into a talent discovery tool.  The rest of the invention's architecture is drafted as answers to YCombinators call to arms.

"What are people going to do for fun in 20 years instead of what they do now?"

To identify what will be different, we identify what will be the same, and study how its interfaces will evolve. A brain comuter interface will usurp screens, ai will produce human level creative content, people will upload their minds, virtual versions of a person will interact with virual versions of other people and engage in commerce, conversation, and creativity with only their human owner's oversight.

Human activity will be focused on training ai's, producing creative content only their minds could produce, maintaining their automation systems, enjoying the entertainment of the age, flesh and blood relationships, learning, and working on the frontier of science.  Everything will be gameified, reality will be augmented, and tasks unfit for ai will be queued for human completion.

My content creator stock market invention serves the future need to be able to profit from ones creative content, and allows use of existing piracy methods as distribution channels in order to subsume hollywood's role in the creation of content; and is the initial invention of the start up being proposed, a first blow in the war against Hollywood, replacing what it currently does with an alternative that employs the collaborative advantages of the web and the pressure for innovation of a free market.

The second invention is a system for mind uploading wherein unique chatterbots are generated for each user, from whom it will learn by observing the users coversations with other user bots or a masterbot that is trained based on all interaction had between users and other users bots.  Badges and achievments gamify user bot training activities, rewarding users for training their bots until they are turing competitive.

The bots form the basis for a next
generation social network and social discovery network, as users dispatch their bots to converse with other user's bots, and instruct them to identify potential candidates for user to user communication based on success of those automated conversations, thereaby automating the social nerworking busy work, allowing users to view the coversation other users, or other user's bots, had with their own bots as if they were email, which the invention will also automate and subsume.  

The third invention is  an artificially intelligent programming language that evolves its syntax, grammer, ontology, dictionary, and  other programming language features and/or constructs using human guided genetic algorithms to develop a universal compiler capable of translation between, and ability to interpret, any arbitrary machine codes, pseudocode, or programming languages.  

These three inventions address the potential ways people will
be creative, interact socially, and perform meaningful work (by training, using, and correcting the artificially intelligent programming languages) in the future, and serve up an ad hoc death blow, or at least one intended to maim, to an ultimately doomed hollywood cartel.

"New ways to produce shows:"

Using described chatbots to generate screenplays, and the stock market to crowdsource tasks like effects, editing, cgi, sound scoring, etc.

"New ways to distribute shows:"

The ai programming language will encapsulate video as program modules and function as a swarm intelligence existing on a viral botnet with no off switch, whose code for proliferation is itself evolved to increase its ability to infect, minimize detection, and survive removal.

"New media (e.g. games) that look a lot like shows but are more interactive:"

The chat bot mind uploader will be integrated with the creator stock market to gamify the creation of content required to produce a library of user video clips that can be tied togther to create completly new videos, and facial expressions and speech captured for superimposition on computer generated bodies and backrounds and synchronization with user chat bot conversational databases.

"Some of the best ideas may initially look like they're serving the movie and TV industries:"

The creator market serves as a talent scout for hollywood, the chatterbot integrates with facebook and operates thereupon obsequiously. The artificially intelligent programming language serves as a developer tool for the web but eventually become an alternative internet.