For my project the default dictionary provided by the Sphinx is not sufficient.
I need to use another custom dictionary along with the provided dictionary.
Now my question is that is there any way of specifying multiple dictionary files to Sphinx or do I need to combine both the dictionaries into a single big dictionary file?
Thanks in advance :)
You have to combine dictionaries into single one.
If you want multiple dictionaries with Sphinx, what I did was to make my program delete the previous dictionary and write another one to the same file path with the different words you want to use. This can be done back and forth as many times as you like to give the impression of multiple dictionaries.
Related
They look like this:
abaft
abbreviation/M
abdicate/DNGSn
Abelard/M
abider/M
Abidjan
ablaze
abloom
I am using this kind of dictionary with Node.js application, but I will need it to be smarter. Specifically, I want to remember occurrence probability of every word based on already processed text. I'd like to save this information in existing .dic file - but how to do that without making it invalid?
Is there any comment syntax that would allow me to store additional data next to the words in file? Such that normal dictionary parser will ignore it?
I'm building a web app that's going to support multiple languages. For the moment, the words are hard-coded in English in the HTML and javascript. I want to use objects that contains the English word as the key and the word to be displayed as the value and have this dictionary object populate the page at runtime. But my question is not about client-side issues.
What's the best way to store and maintain this dictionary on the server. So far, I thought of a database table with columns for the English word and rows for the values to be displayed. I would then load an entire row in an anonymous type that I'd serialize in json and send the client. I think it'd work but I'm wondering if that's the best way to do it so that the dictionary will be easy to maintain.
If you've had some experience dealing with internationalization then please let me know if you have some suggestions.
Thanks.
You can use Resouce files for multiple language support for asp.net web. It is THE BEST method to use for multiple language support in asp.net. http://msdn.microsoft.com/en-us/library/fw69ke6f%28v=vs.100%29.aspx
what has to be done for including Localization Concepts?
may i get the source code for each & every steps to be done?
is it necessary to include the resource files?
what has to be entered in the String & in the Value column while creating the Resource File?
What has to be done for including Localization Concepts?
The first step is to actually understand these concepts... You may want to read about it here:
Effective Strategies for Localization in .NET
May I get the source code for each & every steps to be done?
Oh dear... Such question is too broad and could be closed as not answerable. You would found plenty of examples and tutorials online. Just read them and ask specific questions about things you do not understand. But first, please try to search these questions using your favorite search provider (most of them were answered, some probably multiple times).
Is it necessary to include the resource files?
It is not necessary. However, for most of applications it is both easiest and fastest method. And simply the best method. And the only one which is really easy.
What has to be entered in the String & in the Value column while creating the Resource File?
In the first column you would need to put resource identifier - unique key that would identify your translatable string. Depending on Localization method you would choose, it could have different forms (for implicit Localization it is best to keep it as control_identifier.property_name, where control_identifier is a string you put in ID attribute and property_name is name of the property you want to translate, i.e. lblName.Text). In the Value column you need to put your translatable string.
I've scoured the net and cannot seem to find an appropriate example so I thought I'd ask...
(Btw, much of this is new to me- not all, just most.)
Problem: trying to convert a bio/python nested dictionary (or xml) of pubmed citation data into a flat (normalized) structure eg, sqlite. Citation data was fetched from pubmed using biopython and was parsed into a dictionary, but can also retrieve as xml if needed.
Not all citations will have all fields/keys and not all fields/keys will have the same number of items (authors, mesh terms, refs, etc...) and understand that this is part of the normalization process.
This is about where my practical understanding ends.
That said, I think the process should go something like this: first remove/normalize all unique fields (those that have 1 per paper eg, title, abstract, date, citation, etc..., but say not affiliation as that would be linked to first author). Papers with no abstract could be filled as null?
Then move on to, say, authors and create a separate table again using PMID as the fk and then do same for the various other fields/keys/items in separate tables eg, mesh headings, EC numbers, ref, etc...
Is there a way to do this that removes (pops?) keys/items from the master dictionary so that I can visually see what's been done/needs to be done (obviously leaving the PMID)?
Again, apologies in advance if I'm asking a blindingly obvious question to the initiated- and I do understand that you can't fit a nested structure into a flat space- just looking for the least boneheaded way of going about this and hopefully one that will allow me to make sure that everything was properly captured.
Many thanks,
chris
A quick question -- if you already have the data in XML, why are you normalizing it into a SQL format? Why not just use the raw XML? Berkeley DB XML is a library (like SQLite) that links into your application. There is no separate server to install or maintain. The library allows you to store and query XML data using XPath or XQuery. It's very fast, has a small footprint. is transactional, recoverable and highly reliable. It has HA features as well, if that is required.
Keeping the data in XML should simplify the whole data import process and still allow you to query the semi-structured data.
I need to add dictionary facilities to an Asp.net MVC app. Does anyone know a library that I could use? Where can I get word definitions from?
Any help is appreciated.
it depends on what you want to do.
if you're trying to add a reference of an actual dictionary of words to use for your application then you'll have to create your own to define the words that you want to use.
if you're trying to find the library that contains the data type dictionary then you should try
System.Collections.Generic, or System.Collections.Specialized
If its still relevant for you, It's not a 'library' but on Dicts.info site you can find raw dictionary files you could use inside your application.
You can also find bilingual dictionaries for various languages or even categorized vocabulary, and even more...
Watch out to respect their respective licences.