Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
I'm looking for a in house geocoding tool to geocode millions of address. I've tried on TIGER database, but it got only about 60% address rooftop. There are addresses way far away from the actual address. My needs are:
1. fast enough to process those millions of address in days
2. rooftop accuracy - shouldn't be too far away (I'll say less than 100 foot mistake)
3. in house service - so it should be free to our internal staff
4. ideally open source, but it's ok to have a one time cost to set it up
Currently I'm looking at application level infrastructure, and I'm open to dedicate map server or something like that. I just don't have enough information to start researching.
Feel free to throw me any ideas, thoughts, comments. I'd love to hear them!
There are two pieces to this problem.
the geocoder and how well it parses addresses and matches them to the reference data set.
the reference data
For 1, I have extracted the parser standardizer from PAGC into a postgresql stored procedure (which is OpenSource) and then built a couple of geocoders using that as the heart of the engine.
For 2, and the accuracy that you are looking for, you will likely need high quality commercial data like Navteq or parcel data. Tiger is good for the cost to get you near the location but Title 13 requires Census to fuzzy the address ranges to no single address can be matched to a Census form. So as you found out, Tiger will not do the job.
I have written a lot of geocoders and have one that will work with Navteq and should give you results that are close to your requirements. Check out http://imaptools.com/ and contact me if your interested.
Related
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about programming within the scope defined in the help center.
Closed 1 year ago.
Improve this question
I have to write the "assumptions" part of a pentest report and I am having trouble understanding what I should write. I checked multiple pentest reports (from https://github.com/juliocesarfort/public-pentesting-reports) but none of them had this paragraph. Also I found this explanation "In case there are some assumptions that the pen-tester considers before or during the test, the assumptions need to be clearly shown in the report. Providing the assumption will help the report audiences to understand why penetration testing followed a specific direction.", but still what I do have in mind it is more suited for "attack narative".
Can you provide me a small example (for one action, situation) so I can see exactly how it should be written?
I would think the "assumptions" paragraph and the "Attack narrative" paragraph are somehow overlapping. I would use the "Assumptions" paragraph to state a couple of high level decisions made before starting the attack, with whatever little information the pentester would have on the attack. I would expand on the tools and techniques used in the "Attack narrative" paragraph
For example an assumption could be:
"The pentester is carrying on the exercise against the infrastructure of a soho company with less than 5 people It is common for soho companies to use consumer networking equipment that is usually unsecure, and left configured as defualt. For this reason the attacker focused on scanning for http and ssh using a database of vendors default username and passwords"
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a book, tool, software library, tutorial or other off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
I am working on getting the performance parameters of a tcp connection and one these parameters is the bandwidth. I am intending to use the tcp_info structure supported from linux 2.6 onwards, which holds the meta data about a tcp connection. The information can be retrieved using the getsockopt() function call on tcp_info. I have spent lot of time finding a good documentation which explains all the parameters in that structure, but couldn't find one.
Also I tested a small program to retrieve the values from tcp_info for a tcp connection where I found the measured MSS values for most of the time as zero.To make long story short-Is there a link to follow for which has complete details ontcp_info and also is it reliable to use these values.
Here is a fairly comprehensive write-up of the structure and use of the linux tcp_info by René Pfeiffer but there are a couple of things worth noting:
The author needed to look at these data repeated over time because there are no aggregate stats in that structure.
The author directs you to the tcp.c source as the final authority on the meaning of any of those data.
I'm not sure what you were hoping to get from the Maximum Segment Size, but expect you thought it meant something else.
If you are truly interested in exact measurements of bandwidth you need to use a measurement device which is outside the system being tested as even pulling the ioctls will affect the phenomenon you are interested in knowing about. A passive wire sniffer is the only way to get truly accurate results. Finally, depending on your application, "bandwidth" is a really broad umbrella which flattens many measurements (e.g. latency, round-trip-time, variability, jitter, etc.) into one category.
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Can you suggest sites where I can generate dummy data based on my requirement for testing purposes of my project?
Note: I need a dummy data for usage of VMs and physical servers in terms of memory, CPU, disk, I/O utilization in percentage. Is there any site which provides utility to generate this kind of data?
Check out InfoChimps, they may have the sort of data your are after. But if you're just looking for numbers, it should be exceptionally trivial to just generate them yourself.
Maybe you can try http://www.generatedata.com/#generator
Obviously late to the discussion but in case anyone finds their way here. The baseball database contains some moderately large datasets (160,000+ records in the Fielding table, I believe).
Check out my MySQL Datagenerator. Perhaps that is what you are looking for...
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about programming within the scope defined in the help center.
Closed 8 years ago.
Locked. This question and its answers are locked because the question is off-topic but has historical significance. It is not currently accepting new answers or interactions.
I'm thinking of forming a Hackers Club at work. My idea is that we would meet monthly and at each meeting one member would present an interesting hack he had created. (The hacks presented wouldn't necessarily have to be software hacks; they could also be the sort of things you read about in MAKE magazine.)
There would also be ANSI standard pizza, veggie pizza, and beer and pop available for socializing afterward. I'm even thinking of calling the club "TMRC" even though it will have nothing to do with model railroads.
Has anyone ever tried doing something like this or have any advice?
We do this at the office. I call it 'Developer Fight Club'
Usually do challenges of varying difficulty and compete against one another.
At the end of it, we go over our solutions, do code-reviews and discussions, and then use either benchmark results or other people as the deciding factor for who wins.
Typically, the loser has to buy lunch for the winner :)
For ideas of things to do, try stuff from Top Coder, programming questions on Stack Overflow, or even simple "crackme" applications available on different programming sites.
The main rules you'll need to adhere to are:
Make It Fun
Make It Educational Make
Make It Fair
Try to rotate the challenges, so either everyone is really good at the subject, equally bad, or at least mix it up often enough that it doesn't favor one person's skillset too much.
If there are women in your 'hacker' group, consider the advice given in the Howto Encourage Women in Linux. Especially the 'meeting places and times' section.
This is a little beyond what you asked, but there is good info on how to plan for hackers and how to resolve issues among members in a polite, hackerly manner.
Overview:
http://events.ccc.de/congress/2007/Fahrplan/events/2133.en.html
Videos:
http://chaosradio.ccc.de/23c3_m4v_1500.html
http://chaosradio.ccc.de/24c3_m4v_2133.html
My favorite is the Tuesday Pattern:
If there's a scheduling conflict such that no day of the week is good for everybody, just hold the event on Tuesday. No exceptions! Simple and fair :-)
Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I haven't been to enough of these "live" events to really determine which, if any, are worth the time / money. Which ones do you attend and why?
For conventions, if you're still in university, and can make it to Montreal, Canada, the Canadian Undergraduate Software Engineering Conference (CUSEC) has been extremely enjoyable. See the 2009 site for the next event, and for a take on what previous years have been like, take a look at the 2008 speakers (note: it included Jeff Atwood).
I attend CUSEC primarily because our software engineering society on campus makes a point of organizing a trip to it, but also because of the speakers that present there, and the career fair.
I used to belong to my local Linux User Group which I co-founded but I treated it more as a social event than anything else but obviously a social event full of geeks is still a great way to get a great debate going :)
Conventions and the like I've not got much out of other than being pestered by businesses who can offer me nothing that is apart from a bunch of Linux and Hacker ones where I've met loads of people who I consider friends offline, again great for the social aspect but pretty worthless to me in other respects.
That's not to say I never got any business out of attending various events it's just that treating them as social occasions meant any business that did come my way was a bonus so I never left an event feeling like it was a waste of time.