More

steve1820 · on Sept 24, 2020

Thanks! I appreciate the encouragement :-)

steve1820 · on Sept 24, 2020

Apologies for that Colin! I'm using Squarespace for my personal blog.

I'll investigate further to make it more readable.

EDIT: spelling

ColinWright · on Sept 24, 2020

Thanks for the response ... it might be fine for people with better eyes, or better monitors, but as you can see from the screenshot, the contrast is very low for me.

Love the work, I've saved the link into my system and will be reading it more carefully later. If you'd care to send me an email I'll send you a link to some draft thoughts I've had about knowledge systems. No obligation, obviously.

Cheers!

steve1820 · on Sept 24, 2020

Haha! Talk about learning something new every day!

steve1820 · on Sept 24, 2020

Interesting points regarding the internal mental model!

I agree, for effective information retrieval we (as humans) need to remember the context/ mindset where/when we consumed the knowledge.

I haven't really thought about this problem. It is definitely something to ponder on.

steve1820 · on Sept 24, 2020

I probably won't be maintaining this unfortunately :-(

You have a good point about interoperable components!

steve1820 · on Sept 24, 2020

Awesome open source project as well!!

Good find.

thawab · on Sept 24, 2020

Hello steve, I like how you are integrating pytorch and other ML models in your project. Looking forward to it and I hope you the best. Do you have a twitter profile or a mailing list? I found only your email and linked in profile on the website. I want to be updated on this project.

steve1820 · on Sept 24, 2020

It was my first time playing around with Electron! Overall pretty decent experience.

What makes you hate it if you don't mind me asking?

selfhoster11 · on Sept 24, 2020

Here's a pretty good HN thread that highlights some criticisms of Electron: https://news.ycombinator.com/item?id=18733989

steve1820 · on Sept 24, 2020

Thanks, reading it now. Good points on memory usage and latency.

wraptile · on Sept 24, 2020

Not OP but main gripe is that it fails to integrate with native environments and for efficiency-orentied note taking app losing that efficiency edge native environments provide is just counter-productive.

steve1820 · on Sept 24, 2020

Vaporware is better than going nowhere! (Get it...noware...haha).

Congrats on getting started.

I agree with Obsidian - I think that most people forget the maintenance time it takes to build a lifelong Knowledge Management System.

I like your idea - document similarity is a well known area in ML.

Feel free to take my Chrome Extension and use the parts where it tracks key paragraphs in an article (using a user's click/ hover/ attention behaviour) and use that as the corpus for your ML similarity models.

Intuitively it makes more sense to run document similarity on key points/ paragraphs than the whole web page.

If you want the whole web page though, there's code in the Chrome Extension that use's Mozilla's readability lib (https://github.com/mozilla/readability) to purify the web content.

dx87 · on Sept 24, 2020

Thanks for the tip on the readability library. I don't have much experience with webdev, so my extension was just saving a local copy of whatever was returned every time the browser made a request, I should be able to cut down on storage space if I can use the readability library to skip saving things like trackers and images.

steve1820 · on Sept 23, 2020

Thanks man! I really appreciate it :-)

steve1820 · on Sept 23, 2020

This blog is a summary of a fun 1 month adventure I had with Knowledge Management Systems and building a POC that I thought had potential. It was inspired by so many of the products I see on Hacker News.

I’ve open sourced all the code + written down some notes on my insights/ architecture. Just a warning, the code leaves much to be desired as this was a mini project over 2-3 weekends.

At the moment, I have 3 key layers of the application.

The first is Chrome Extension which allows data collection. It tracks all sorts of data such as attention, time on page, scroll/ click and hover behaviour. It then sends this data to the Electron app.

The Electron app receives this data through an API and saves it down locally using PouchDB (data structure is compatible with Apache CouchBD and thus allows easy cloud storage).

Within the Electron app, I use an Express/ NodeJS web server to expose endpoints for functionality with the front end/ receiving data from Chrome Extension.

For the front end I use ReactJS.

If you’re interested, please read the full article! There are lots of videos of the application in action.

TrailMarker · on Sept 24, 2020

Built something like that 5 years ago https://twitter.com/MindDriveCo/status/668518135880916992 Have been working ever since to address many of the issues discussed in this thread and lot more. On the way to turning Engelbart's Conceptual Framework https://www.dougengelbart.org/content/view/138 for Augmenting Human Intellect into a Practical (meta) Framework. "Go Meta Young Man' and join the effort to build a Kernel for Open Collective Intellect on the Decentralized Web. It is not a commercial venture, but wanting to contribute to Web 3 something comparable in impact to Ward Cunningham's Wiki, something that have the potential to trully change the workld as Ward Cunningham idea of the Wiki had.

Primarily Enduring, Open, Co-evolvable, Bootstrapable, metadesignable, promoting edge-user autonomy, uneclosable collaboration and much more

https://www.researchgate.net/publication/334126329_Weaving_a...

steve1820 · on Sept 24, 2020

I'll have a read...very interesting!

steve1820 · on Sept 24, 2020

Forgot to add: It also tracks what you click so it can build relationships between articles/ blogs/ anything else you read on the internet.

For example, if I’m reading an article about big data/ ETL pipelines and then I click on a link from within the article to a resource on machine learning, this relationship should be tracked and digested.

ydj · on Sept 23, 2020

This is neat. I had a similar idea of tracking what we read online in the browser. In addition to just recording time spent and links followed, I would also archive every article read to automatically build up a personal library of the articles themselves. Additionally, light weight note taking on the articles themselves ala hypothe.is

Unfortunately a new baby has drained all my time for such pursuits.

Hope you develop your concept further!

steve1820 · on Sept 23, 2020

Awesome that we came up with similar ideas!

So right now, the Chrome Extension tracks all link clicks as well so the visualisation actually builds relationships between articles/ blogs/ anything else you read on the internet.

I also had another idea that once we build our Knowledge Maps, we should be able to compare and share it will one another.

For instance, I'd love to see what our software engineers around my age/ experience are reading and the insights they are drawing from articles.

Obviously some sort of security/ privacy mechanism will need to be implemented as well.

xrd · on Sept 23, 2020

Do you then sync to a real CouchDB? Or, do you use PouchDB server on the backend, or otherwise?

I'm interested in understanding how you eventually translate to Graph4j and how you do authentication. Where can I look for this?

This is really, really cool that you open sourced this. Thank you.

steve1820 · on Sept 23, 2020

No worries! I'm always leeching from the open source community so I'd thought to give back haha.

The visualisation library I'm using is by Ant Design (Alibaba).

You can see some examples below from the documentation.

https://g6.antv.vision/en/examples/gallery

Re syncing with real CouchDB - I haven't implemented that yet but from what I read, its definitely possible with PouchDB.

Re I have a bunch of Express APIs that expose PouchDB so I can do basic CRUD operations such as creating resources etc.

xrd · on Sept 24, 2020

It is definitely possible to sync with CouchDB. I've just found that the uncertainty around how to do authz/authn (CouchDB recommends, now, that you do this in your own proxy layer) and the difficulties to make sure the entire process works, makes debugging tricky. I've tried with a variety of NodeJS+CouchDB and PouchDB-server and nothing gives me a solution which I don't have to babysit quite a bit.

ErikBjare · on Sept 24, 2020

This is a really cool use of browser history data. I'm building ActivityWatch [1] and have been thinking of building something similar on top of it.

[1]: https://activitywatch.net