Open source online document editor - document-management

Can somebody tell me some open source projects that implement document management online?
I need to upload document (pdf, docx,fb2), convert it to doc and allow user to edit it online and then convert back to pdf.
Images and formatting should be preserved.
I found teamlab, but it is not free and also I don't need collaborative editing.
Thank you.

It depends what your requirements are when you say 'online editing'.
You could use Nuxeo including the Nuxeo Drive extension, to enable users to edit the files from the remote repository locally (transparent in the sense the user does not take the step of downloading or uploading files), and their changes are then synced to the repository.
The user can edit the .doc file locally using OpenOffice (or MS Word if they have it).
If the requirement is strictly 'online/web only', you could convert the PDF to RTF via an OCR engine such as Tesseract, and then use one of the many WYSIWG inline editors, and connect this to Nuxeo as an edit button using their extension framework. There is an existing tesseract-ocr extension available at the Nuxeo source repo.

Related

How to open a saved Google Onhub diagnostic report?

I have a saved Google Onhub diagnostic report. With Notepad I can't see the whole file, because it's encrypted. A Hex-Editor won't open it neither.
Can someone tell me if there is a program to open it correctly?
Thank you!
It's a protobuf encoded file, containing an archive of files and other data entries.
There's a Go project on github that decodes it, and even if you don't run their Go code, you can use the protobuf schema that's available in that repository to decode the report in whichever environment you choose using the protobuf bindings available in the official website

Bulk Upload PDF Forms to AEM Repository

We recently began using Adobe LiveCycle and Adobe Experience Manager. We're not using AEM for our web site, however. We're just wanting to use the Forms Portal portion of it to allow us to organize our forms, tag them, etc, then provide a searchable interface.
We have several hundred pre-AEM/LiveCycle PDF forms. Rather than manually uploading one PDF file at a time, we would like to do a bulk upload into the repository.
If I remember correctly, I saw someone do this using CRXDE Lite, navigating to /content/dam/formsanddocuments, then dragging the documents in. However, when I try that, it just opens up Acrobat Reader to display the PDF.
Any help would be greatly appreciated!
I'm not sure about drag/drop in the browser (never tried it), but there are a couple of methods you could use:
Curl / Bash
If you're happy scripting this in Bash, one method is to use curl to upload the file, along the lines of:
curl -F"./*=#form1.pdf" \
-F"./*#TypeHint=nt:file" \
http://admin:admin#localhost:4502/content/dam/forms/form1
For example, you could create a script to loop over all the PDFs in your directory & upload them one at a time.
VLT
Another option would be to add create a content package with Maven, build it and install via CRX Package Manager.
Or you could use the Filevault (VLT) tool that Adobe provide as a VCS-style link between your file system & the repository — checkout to a directory, add the PDFs in and then do a vlt add; vlt ci to push them back into AEM.
WebDav
AEM supports mounting the repository via WebDav, so that you could drag the files in using Finder/Explorer.
It can be slow, but if you're just doing a one-off dump of files into AEM, it could be an option.

How to use AJAX File Browser for editing PDF files

AJAX File Browser works great with Microsoft Office documents. However, I do note that the same does not work well with PDFs.
Currently in our application, 95% of the documents being reviewed/edited are PDFs. So is there a way I can make the PDFs work with Ajax File Browser?
Basically I'm looking at a solution/product which will help us with editing the PDFs using Adobe reader/professional and storing them directly on the server.
The only way to open the the PDF document is using Java applet (or potentially other method with unrestricted access to local file system). The Ajax File Browser is provided with Java applet which mount mounts local file system and opens document for editing. The Java applet automatically engaged for non-MS Office documents or if MS Office is not found. This works on Windows, OS X and Linux.
You can find info about when the Java applet is engaged here: http://www.webdavsystem.com/ajax/programming/opening_ms_office_docs

Is there a program that lets me edit web files with a native editor?

Before I attempt to program the following function myself, I wonder if something already exists.
What I would like to do is click an edit link on my website for a given document, and have that document launch in the native editor on my local machine (via a temporary file mechanism).
When I save the document in the native editor, the document is HTTP PUT back to the website. This can be accomplished by watching the file for writes, or watching the editor process for exit.
This way I can more easily edit documents on the web (instead of going through the download / edit / upload cycle).
My design would work as follows:
Register .webedit files on the local machine.
When a .webedit file is downloaded, launch webedit.exe with the file.
The file contains a URL (http://server/document) which is checked against a security database to ensure we're only opening allowed URLs.
The URL is downloaded to a temporary location.
The temporary file is launched in the native editor.
The file is watched for changes, and uploaded (HTTP PUT) on change detection (or when the editor is closed, if it's not a single-instance multiple-document editor).
Lots of FTP / SCP GUIs have this type of functionality, but I have not been able to find it for the web in general, or a shared library that allows you to plug in to this function.
Has anyone seen a program that does this?
SharePoint works like this.
It's great for managing shared documents in corporate environments.
Users can even checkout/checkin documents & the features are very extensible..you can customize pretty much anything if you know how.
Edit:
Since you're on Linux..i've heard that Alfreco is a great alternative.
I've never used it, but I know a couple organizations using it instead of SharePoint.
It integrates with Microsoft Office as well.
Also, it will definitely be cheaper.

Advanced image editing off the web

I'm building an app in ASP.NET that will store some pictures of objects. The pictures will be uploaded by suppliers and downloaded by subscribers. In between, they will have to be edited before becoming available to subscribers.
The editing involves creating a cropping path tightly around the object in the picture, in which some advanced desktop image software will have to be used I suppose.
My problem is in exchanging pictures between my ASP.NET app and the desktop software in a manner that is easy and transparent for the user.
I've done some thinking and I've come up with:
- Manually downloading and uploading the image (Not much user friendly...)
- An image editing program that can upload to a web service (Haven't found yet...)
- Develop a plug-in for an image editing program (Too advanced...)
I'd appreciate any suggestions you may have, thank you!
It sounds like you need some automation to move files between the web server and a file share. I am assuming that the number of images that need to be processed is pretty large, because if it's not, then the overhead of downloading/re-uploading each would not be that much.
So do the following:
1) Create an API for your web app that lists files that are available, or new files since some date/time, or files that have been marked as "new". The API should probably also allow marking a status on them (so you can tell it when you've finishing pulling something down, and it won't be offered again) if you don't want to trust date/time as an indicator of it being new.
2) Write an app (non-web) that runs on a schedule and uses this API to automatically download files to a shared filesystem area in your local network, and marks them as "downloaded"
The app should also monitor these files (the ones it downloaded & saved to your local share) for changes, and if changed, upload them back to your web app. To do this you may need to keep a database of filenames and modification dates/times.
This shouldn't be too hard to write in whatever language you are using for your web (assume c# or vb). By "API" I just mean, a web page that provides a list in a standardized format (e.g. json) that you can parse with your automation application, and another page that allows posting the file back for re-upload.
I'm assuming that the web server is not your own, or generally, you can't simply have it save the file uploads directly to some area where your image editors can access them. Otherwise you could just do that.
Meanwhile I came out with another possible solution.
I'm thinking of having our own windows app on the editor's computers. This app will be associated with a custom extension. When an editor downloads a file (with this extension) for editing, it will be opened in our application which in turn will open the image in some editor program.
This app will be monitoring the files for changes, and in such case, it will upload these images.
Any thoughts on this?

Resources